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Preface 

o 
o 

(N 
Oh; 

' When I was asked by K. K. Phua to do a book for World Scientific based on my 

work, he suggested a volume of essays or a reprint volume. I have decided to combine 
these two suggestions into one, by preparing a reprint volume with commentaries. 
Some of the commentaries are drawn from historical articles that I have written 
^ ■ for publication, others are drawn from unpublished historical accounts written for 

. institutional archives, and yet others have been written expressly for this volume. In 

the commentaries, I try to relate the reprinted articles to the time-line of my career, 
' and at the same time to analyze their relations with the work of other physicists 

■ whose work influenced mine and vice versa. 

. In keeping with these dual aims, I have arranged the articles and the commen- 

I taries in approximately chronological order, but occasionally deviate from strict 

Q^' chronology in order to group topically related articles together. In choosing which 



articles to include, I have been guided by two generally coinciding measures, my own 



D . estimate of significance, and the citation count. However, in occasional cases I have 

' included infrequently cited articles where I felt that there was an interesting related 

story to tell. Often, when finishing a line of work, I have written a long summarizing 
^ ■ article or review; some of these are too long to be included in their entirety, and so I 

have included in the reprints only the sections most relevant to the narrative in the 
commentaries. Similarly, I have not included among the reprints the summer school 
lectures I have given on current algebras, anomalies, and neutrino physics, but refer- 
ences to them appear in the commentaries. In the last decade, I have published two 
books related to my work on generalized forms of quantum mechanics, and included 
many research results directly in these books in lieu of first writing papers. It is 
feasible to give only brief descriptions of these projects in the commentaries; I have 
included just a few papers from this period, all in the nature of follow-ons to the 
first book. 

In both the texts of the commentaries and the reference lists that follow them, 
reprinted articles are identified by a sans serif R, so that for example, Rl designates 
the first reprinted article. Numbers in square brackets following each reference in 
the reference lists give the pages in the commentaries where that reference is cited. 
There is also an index of names following the commentaries, and a list of detailed 
chapter subheadings in the Table of Contents. 

I wish to thank Tian-Yu Cao for a critical reading of the commentaries and much 
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helpful advice, Alfred Mueller for a helpful conversation on renormalon ambiguities, 
Richard Haymaker for a clarifying email on dual superconductivity parameters, and 
William Marciano, Robert Oakes, and Alberto Sirlin for calling my attention to 
relevant references. I also wish to thank the following people for sending me help- 
ful comments on the initial draft of the commentaries after it was posted on the 
archive as hep-ph/0505177: Nikolay Achasov, Dimi Chakalov, Christopher Hill, Ro- 
man Jackiw, Andrei Kataev, Peter Minkowski, Herbert Neuberger, and Lalit Sehgal. 
I am grateful to Antonino Zichichi for permission to use the quote from Gilberto 
Bernardini in Chapter 2, to Mary Bell for permission to use the quote from John 
Bell in Chapter 3, to James Bjorken for permission use his quote in Chapter 3, and 
to Clifford Taubes for helpful email correspondence and permission to use his quotes 
in Chapter 7. 

My editor at World Scientific, Kim Tan, has given valuable assistance throughout 
this project. Miriam Peterson and Margaret Best have patiently assisted in the 
conversion of my TeX drafts to camera-ready copy and with indexing, the latter a 
task that was shared with Lisa Fleischer and Michelle Sage. I am also indebted to 
Momota Ganguli and Judy Wilson-Smith for bibliographic searches, to Christopher 
McCafferty and James Stephens for help with computer problems, and to Marcia 
Tucker and Herman Joachim for assistance, respectively, in scanning and duplicating 
certain of the papers to be reprinted. Finally, I wish to express my appreciation to the 
Institute for Advanced Study (abbreviated throughout the commentaries as IAS) for 
its support of my work, first from 1966 to 1969, when I was a Long Term Member, 
and then from 1969 onwards, when I have been a member of the Faculty, in the 
School of Natural Sciences. My work has also been supported by the Department of 
Energy under Grant No. DE-FG02-90ER40542. 

In addition to the publishers acknowledged on each individual reprint, I also 
wish to thank World Scientific for the use of material originally prepared for their 
volumes commemorating the 50th anniversary of Yang-Mills theory. Chapter 3 on 
anomalies is largely based on an essay I contributed to 50 Years of Yang-Mills The- 
ory, edited by G. 't Hooft, and the parts of Chapters 7 and 9 dealing respectively 
with monopoles and projective group representations are based on an essay I wrote 
for a projected companion volume on the influence of Yang-Mills theory on math- 
ematics. Also, some material in Chapters 2 and 3 overlaps with the contents of a 
letter on antecedents of asymptotic freedom that I wrote to Physics Today, which 
appears in the September, 2006 issue. 
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1. Early Yeeirs, and Condensed Matter Physics 



A brief synopsis of my career appears in an article that I wrote recently for the Abdus 
Salam International Centre for Theoretical Physics (Adler, 2004, Rl), which includes 
a description of events when I was young that led to my becoming a theoretical 
physicist. The focus of this article is on the career path that led to my work in high 
energy physics. However, before I ptiblishcd anything in high energy theory, I spent 
several summers working in industrial research laboratory jobs in condensed matter 
physics, and it was this work that led to my first scientific publications. 

By the end of my junior year at Harvard, I had taken courses in quantum me- 
chanics and also in condensed matter physics (then called solid state physics). With 
this background, during the summer of 1960, I got a job working for Joseph Birman, 
who at that time (before going on to Professorships at New York University and then 
City College of the City University of New York) headed a section studying electrolu- 
minescence at the General Telephone and Electronics (GT&E) Research Laboratory. 
This industrial research laboratory, formerly the Sylvania Research Laboratory, was 
conveniently located a few miles from where my family lived in Bayside, Queens. I 
had a desk in an office looking out over the entrance to the Long Island Sound, from 
which I could see sections of roadway being hoisted into place on the Throgs Neck 
Bridge, then under construction. 

During my first weeks at GT&E, Joe got me started learning some basic group 
theory as applied to crystal structures, and then suggested the problem of using these 
group theory methods to check a formula that Hopfield (1960) had given relating 
band theory structures in hexagonal and cubic variants of zinc sulfide (ZnS) and 
related compounds, substances that Joe had been studying (Birman, 1959) with an 
eye to electroluminescence applications. This turned out to be basically a technical 
exercise and confirmed Hopfield's results. In the course of this work, which I finally 
wrote up a year later (Adler, 1962a, R2), I also attempted an a priori estimate 
of a parameter determined by experimental fits to the Hopfield formula. This got 
me interested in the Ewald sum method for doing crystal lattice sums, on which 
I wrote a paper (Adler, 1961) giving generalized results for sums over lattices of 
functions f {r)Yi^^{0 , (f)) , with a spherical harmonic and /(r) a radial function 
representable as a transform by /(r) = ex.p{—r'^t)g{t)dt. These two pieces of 

work stemming from my summer at GT&E were my first scientific publications. 
With Joe's encouragement, I also gave a 10 minute contributed paper (Adler and 
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Birman, 1961) on the ZnS work at the New York meeting of the American Physieal 
Society the fohowing winter, while I was back home on inter-term break from college. 
Since this was my first conference talk, I typed out a text and went over it so many 
times that I knew it by heart. After my talk, Joe said words to the effect, "That 
was fine, but next time you give a talk don't sound like it was memorized" , wisdom 
that I have taken to heart on many subsequent occasions! 

When I returned to Harvard for my senior year I was told by some of the fac- 
ulty that Henry Ehrenreich from the General Electric (GE) Research Laboratory 
was on leave at Harvard that year, and was giving the graduate course on solid 
state physics, covering substantially different material from what I had heard the 
year before. I attended Henry's lectures, which included a calculation of the en- 
ergy and wave-number dependent dielectric constant in isotropic solids, using the 
self-consistent field or energy-band approximation, along the lines of the treatment 
given in Ehrenreich and Cohen (1959). I got to know Henry outside the classroom 
as well, and he invited me to work at the GE Research Laboratory in Schenectady, 
NY the following summer, after my graduation from college in June 1961. This was 
appealing in a number of ways, since my family had moved to Bennington, VT the 
year before, about an hour's drive away from Schenectady, and so I was able to 
drive home for a visit on weekends. At GE, Henry suggested that I generalize the 
treatment of the dielectric constant that he and Cohen had given so as to include 
various effects of interest in real solids. In the paper that resulted (Adler, 1962b, 
R3), I calculated the full frequency and wave- number dependent dielectric tensor in 
the energy-band approximation, including tensor components that couple longitu- 
dinal and transverse electromagnetic disturbances, which are absent in the isotropic 
approximation but are present even in solids with cubic symmetry. The longitudi- 
nal to longitudinal component of the general dielectric tensor reduces to the result 
obtained by Ehrenreich and Cohen when various identities (reflecting charge conser- 
vation and gauge invariance, as well as symmetries) are used. I also gave a method, 
based on an analysis of "Umklapp" processes that couple wave numbers differing by 
a reciprocal lattice vector, together with use of a multipole expansion, for calculating 
local field corrections to the dielectric constant, giving a modified Lorenz-Lorentz 
formula. (Local field corrections were also studied by Cohen's student Nathan Wiser 
(1963) by a different method.) My paper on the dielectric constant in real solids has 
been widely cited in the subsequent condensed matter literature, reflecting its rele- 
vance for spectroscopic studies of solids, as well as its generalizations to nonlinear 
dielectric behavior. 

Although I had decided to focus on elementary particle theory for my graduate 
study in Princeton, I retained an interest in solid state physics, and returned to 
GE for half of the summer of 1962 to work again with Henry Ehrenreich, this time 
publishing a paper (Adler, 1963) in which I applied the dielectric constant results of 
the previous summer to the theory of hot electron energy loss in solids. Not long after 
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this visit, Henry left GE to take a Professorship at Harvard, where our paths crossed 
again during my postdoctoral years. After finishing my PhD at Princeton in 1964, I 
spent the summer working at Bell Telephone Laboratories in Murray Hill, under the 
supervision of Phil Anderson and Dick Werthamer. However, aside from informal 
notes on the application of raising and lowering operators to the vortex structure in 
type H superconductors, my principal publication resulting from this final industrial 
summer job was a writeup of my work on PCAC consistency conditions, which I 
will discuss in the next chapter. 

References for Chapter 1 

• Adler, S. (1961). A Generalized Ewald Method for Lattice Sums. Physica 
27, 1193-1201. 

• Adler, S. L. (1962a) R2. Theory of the Valence Band Splittings at /c = in 
Zinc-Blende and Wurtzite Structures. Phys. Rev. 126, 118-122. 

• Adler, S. L. (1962b) R3. Quantum Theory of the Dielectric Constant in Real 
Solids. Phys. Rev. 126, 413-420. 

• Adler, S. L. (1963). Theory of the Range of Hot Electrons in Real Metals. 
Phys. Rev. 130, 1654-1666. 

• Adler, S. L. (2004) Rl. Prom Elements of Radio to Elementary Particle 
Physics, in One Hundred Reasons to be a Scientist (The Abdus Salam Inter- 
national Centre for Theoretical Physics, Trieste), pp. 25-26. 

• Adler, S. and J. L. Birman (1961). An LCAO Theory of the k = 0,0,0 
Valence Band Splittings in Zinc Blende and Wurtzite Structures. Bull. Am. 
Phys. Society Series II, Vol. 6, No.l, Part 1, p. 22. 

• Birman, J. L. (1959). Simplified LCAO Method for Zincblende, Wurtzite, 
and Mixed Crystal Structures. Phys. Rev. 115, 1493-1505. 

• Ehrenreich, H. and M. H. Cohen (1959). Self-Consistent Field Approach to 
the Many-Electron Problem. Phys. Rev. 115, 786-790. 

• Hopfield, J. J. (1960). Fine Structure in the Optical Absorption Edge of 
Anisotropic Crystals. J. Phys. Chem. Solids 15, 97-107. 

• Wiser, N. (1963). Dielectric Constant with Local Field Effects Included. 
Phys. Rev. 129, 62-69. 
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2. High Energy Neutrino Reactions, PCAC Relations, 

and Sum Rules 



Introduction 

By the end of my undergraduate years at Harvard (1957-1961), I had gone through 
most of the graduate course curriculum, as well as a senior year reading course orga- 
nized by Paul Martin for my classmate Fred Goldhaber and me. This course gave me 
an introduction to quantum field theory, or more precisely, to quantum electrody- 
namics, through some of the seminal papers appearing in the reprint volume edited 
by Schwinger (1958). Although as a result of my summer research jobs I could have 
gone on relatively easily to a PhD in solid state physics, I wanted to enter particle 
physics, and moreover wanted exposure to styles of theoretical physics different from 
those I had seen already at Harvard. Hence I decided on Princeton for my graduate 
work (with strong encouragement from Harvard faculty member Frank Pipkin, who 
was an enthusiastic Princeton graduate alumnus), and enrolled there in the fall of 
1961. 

My first year there was spent preparing for general exams, mostly by reading. I 
also participated in a seminar organized by the graduate students, which surveyed 
many aspects of dispersion relations and covered some topics in Feynman diagram 
calculations as well. The only formal course I took was one given by Sam Treiman, 
which gave an introductory survey to elementary particle physics. I was impressed by 
the clarity of his approach, and both because of this and because Murph Goldberger 
was planning a sabbatical leave the following year, I asked Treiman to take me on 
as a thesis student. 

This turned out to be a fortunate choice. Treiman proposed that I do a thesis in 
the general area of high energy neutrino reactions, which was just then emerging as 
an area of phcnomcnological interest. After doing a survey of the literature in the 
field, I first did a "preliminary problem" of calculating the final lepton and nucleon 
polarization effects in the quasielastic neutrino reaction i/£ + N ^ £ + N, with all 
induced form factors retained in the vector and axial- vector vertices (Adler, 1964a). 
I did this calculation in two ways, first by using the covariant form of the matrix 
element and Dirac 7 matrix algebra, then by using the center of mass form and 
Pauli matrix algebra, and directly checked the equivalence of the two forms of the 
answer. This convinced Treiman that I could calculate, and incidentally introduced 
me to the axial-vector current and coupling gA which were to be central to my work 
for many years. 
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High Energy Neutrino Reactions, PC AC Relations, and Sum Rules 9 

After this calculation was completed, I decided to make the main focus of my the- 
sis a calculation of the simplest inelastic high energy neutrino reaction, that of pion 
production in the (3,3) or A(1232) resonance region. This problem had the appeal 
of having as a paradigm the beautiful dispersion relations calculation of pion pho- 
toproduction of Chew, Goldberger, Low, and Nambu (1957), which was one of the 
classics of the dispersion relations program. An extension to electroproduction had 
already been carried out by Fubini, Nambu, and Wataghin (1958), but they had 
done no numerical work, and on closer examination their matrix element turned 
out to be divergent at zero hadronic momentum transfer ub when the lepton four- 
momentum transfer squared denoted by (or k'^) is nonzero. There were similar 
problems (surveyed in my thesis) with the other papers then available dealing with 
pion electroproduction or weak production, so doing a complete and careful calcu- 
lation, including numerical evaluation of the cross sections, seemed a good choice of 
thesis topic. It was also a demanding one; although I wrote my thesis and got my 
degree in 1964, my goal of a complete calculation, including the necessary computer 
work, was not achieved until 1968. 

Much of the delay though, was a result of the fact that weak pion produc- 
tion turned out to be a marvelous theoretical laboratory for studying the impli- 
cations of conservation hypotheses for the weak vector and axial-vector currents, 
and this became a parallel part of my research program, as reflected in the title of 
my thesis "High Energy Neutrino Reactions and Conservation Hypotheses" (Adler, 
1964b). Prom Treiman and from my reading, I had learned about the Feynman- 
Gell-Mann (1958) proposal of a hadronic conserved vector current (CVC), and I 
had also learned about the Goldbcrgcr-Trciman (1958) relation for the charged 
pion decay constant, which they had discovered through a pioneering dispersion 
theoretic calculation of the weak vertex. A simplified derivation of this relation had 
already been achieved through the suggestion of Nambu (1960), Bernstein, Fubini, 
Gell-Mann, and Thirring (1960), Gell-Mann and Levy (1960), and Bernstein, Gell- 
Mann, and Michel (1960), that the axial- vector current is partially conserved, in the 
sense that the divergence of the axial-vector current behaves at small squared mo- 
mentum transfer as a good approximation to the pion field, or equivalently, is pion 
pole dominated. (Much later on, after contacts with China resumed, I learned that 
Chou (1960) had given a similar simplified derivation of the Goldberger-Treiman 
relation, as well as further applications to decay processes.) The partial conservation 
hypothesis was an appealing one, but as Treiman kept emphasizing, it was supported 
by "only one number" and therefore had to be regarded with caution. So a second 
goal of my thesis work ended up being to keep an eye out for other possible tests of 
the conservation hypotheses for the weak vector and axial-vector currents. 

Before going on to discuss how these emerged from my weak pion production 
calculation, let me first recall what I knew when I started the thesis work. The first 
chapter of the thesis (written in the spring of 1964) was a theoretical survey; in 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



10 Adventures in Theoretical Physics 

the section headed "Partially Conserved Axial Vector Current (PCAC)" I referred 
only to the papers of Goldberger and Trcinian, of Nambu, of Bernstein et al., and 
of Gell-Mann and Levy cited in the preceding paragraph. In the final section of 
the first chapter, entitled "Survey of Computations Relating to Specific Reactions" 
there is the following reference to the paper of Nambu and Shrauner (1962), which 
was my reference 37: "An entirely different approach to weak pion production in 
the low pion-energy region has been pursued by Nambu and Shrauner. These 
authors assume that the weak interactions are approximately 75 invariant ( "chirality 
conservation" ) . They then obtain formulas for production of low energy pions, in the 
approximation in which the pion mass is neglected, in analogy with the treatment 
of low energy bremsstrahling (sic) in electron scattering." At the time I started my 
calculations, neither Treiman nor I understood the relation between the Nambu- 
Shrauner work and the issue of partial conservation of the axial- vector current. This 
was partly because we were suspicious of the assumption of zero pion mass, and 
partly because the Nambu-Shrauner paper makes no reference to the axial-vector 
coupling gA, so it was not clear whether their "chirality" was related to the weak 
currents I was studying in my thesis. This second point is particularly significant, 
and I will return to it in considerable detail below. I was not able to determine from 
my files (by finding either a reference in my notes or a Xerox copy) when I first read 
the Nambu-Lurie (1962) paper on which the Nambu-Shrauner paper was based, 
but it was probably a year later, in early 1965. 



Forward Lepton Theorem 

Roughly the first year and a half of my thesis work on weak pion production was 
spent mastering the formal apparatus of Lorentz invariant amplitudes (used for writ- 
ing dispersion relations) and center of mass multipole expansions (used for imple- 
menting unitarity) and the transformations between them, the Born approximation 
structure, cross section calculations, etc. Then in the winter of 1963-1964 or the 
spring of 1964 (I can only establish dates approximately by the sequence of folders, 
since I did not date them), I began noticing things that transformed a hard and 
often dull calculation into a very interesting one (just in the nick of time, since I 
was due to finish in June of 1964 and had already accepted a postdoctoral position 
at the Harvard Society of Fellows starting in the fall semester.) 

The first thing I noticed was that at zero squared leptonic four momentum trans- 
fer, my expression for the weak pion production matrix element reduced to just the 
hadronic matrix clement of the divergence of the axial- vector current, which by the 
partial conservation hypothesis is proportional to the amplitude for pion-nucleon 
scattering. I then tried to abstract something more general from this specific ob- 
servation, and soon had a neat theorem showing that in a general inelastic high 
energy neutrino reaction, when the lepton emerges forward and the lepton mass is 
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neglected, the leptonic matrix element is proportional to the four momentum trans- 
fer; hence when the leptonic matrix element is contracted with the hadronic part, 
the vector current contribution vanishes by CVC, and the axial- vector current con- 
tribution reduces by partial conservation (for which I coined the parallel acronym 
PCAC, which has become standard terminology) to the corresponding matrix ele- 
ment for an incident pion. Thus inelastic neutrino reactions with forward leptons 
can be used as potential tests of CVC and PCAC; this became a chapter of my 
thesis and was written up as a paper (Adler, 1964c, R4) as soon as my thesis was 
completed. The paper on CVC and PCAC tests was the first of three papers in 
which I found connections between high energy neutrino scattering reactions and 
properties of the weak currents; the other two were my long paper on the gA sum 
rule, and a paper on neutrino reaction tests of the local current algebra, both of 
which are reprinted in this volume and will be discussed shortly. 

To determine whether the CVC/PCAC test could be implemented experimen- 
tally, I wrote a letter to the neutrino experimentalists at CERN. After a few months 
I received a charming reply from Gilberto Bernardini, who commented "The delay 
of this answer, for which I apologize very much, is due to two facts. The first is the 
known time diagram of the 'modern physicist'. In case you do not know it yet, I plot 
it here: (Diagram with a vertical time axis and an upwards pointing arrow; 'work' 
at the bottom, 'travel & meetings' in the middle, and 'dinners &: ceremonies' at the 
top.) Unfortunately, according to my age, I am already very much in the central 
region and even higher." Bernardini then went on to say that Antonino Zichichi had 
brought my paper to his attention a couple of weeks before, and then continued with 
an analysis of technical problems in executing my proposal. There followed a further 
exchange of letters with Bernardini, with theorist John Bell, and with experimen- 
talists Guy von Dardel and Carlo Pranzinetti. Of particular note, von Dardel wrote 
me a long letter after he read my paper, remarking that the care with which he 
read it was partly due to a skiing accident that had kept him in bed with a broken 
leg and nothing better to do, and giving a formula that he had worked out, during 
his enforced time away from experimental activities, for corrections to my theorem 
when the lepton emerges at a small angle to the forward direction. This formula 
turned out to be not quite right (there was an incorrect energy factor), but started 
me thinking about the issue, which I discussed with John Bell when I attended an 
Informal Conference on Experimental Neutrino Physics at CERN, January 20-22, 
1965. Bell had redone the calculation of the pion exchange contribution to the small 
angle correction by splitting the amplitude into spin-flip and non-spin-flip parts, 
getting a result that turned out also to be not quite right (there was a factor of 2 off 
in one term). When I got back to Harvard I repeated the calculation, according to 
my notes, by the "Bell method" , and also by a covariant method, and got a formula 
that I never published, but conveyed in letter of Feb. 10, 1965 to Bell (with copies 
to Bernardini, Block, von Dardel, Faissner, Pranzinetti, and Veltman, most of whom 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



12 Adventures in Theoretical Physics 

I had talked with when I was at CERN). The corrected small angle formula states 
that the first factor on the second line of Eq. (16) of R4 should be replaced by 



2u n2 



2fc2o(A:2 + M2) 



+ 



2(A;2 + M2) 



with k'^ = m|fco/fc2o + ^10^20^^ the Icptonic four-momentum transfer squared and 
with 6 the lepton-neutrino polar angle, assuming that the lepton-neutrino azimuthal 
angle has been averaged over. 

Even before my visit to CERN, Bell (1964) had noted that when one considers 
my forward lepton formula in the context of nuclei, "the following difficulty presents 
itself: Because of absorption, pion cross sections depend on the size of large nuclei 
roughly as A^^^. But neutrinos penetrate to all parts of nuclei; for them cross sec- 
tions should contain at least a part proportional to A. This indicates for large nuclei 
a critical dependence of (t{W, —q^) on g^." Bell proceeded to use optical model meth- 
ods to discuss this "shadowing effect", which has continued to be of interest over 
the years. It took many years for my forward lepton formula, and Bell's shadowing 
observation, to be experimentally verified; for a survey of the status of both, and 
further references, see the recent conference talk by Kopeliovich (2004). An earlier 
review of Mangano et al. (2001) also discusses the experimental status of shadow- 
ing, and a good exposition of the theory is given in the review of Llewellyn Smith 
(1972). For specific applications of the forward lepton formula to exclusive chan- 
nels, see Ravndal (1973) and Rein and Sehgal (1981) for A(1232) production, and 
Faissner et al. (1983) for coherent vr" production (which was used to determine the 
coupling strength of the isovector neutral axial- vector current). Also, Sehgal (1988) 
and Weber and Sehgal (1991) discuss an interesting analog of the forward lepton 
theorem for purely leptonic neutrino-induced reactions. 



Soft Pion Theorems 

Returning now to my thesis work in the spring of 1964, the second thing that I 
noticed, again working from my explicit expression for the weak pion production 
amplitude, was that when I imposed the PCAC condition at zero values of the 
hadronic energy variable v and the hadronic momentum transfer variable ub, only 
the Born approximation pole term coming from the nucleon intermediate state con- 
tributed; all of the model dependent parts of the weak amplitudes dropped out. 
Thus I got what I called a "consistency condition" on the pion-nucleon scattering 
amplitude A^^^+\ implied by PCAC, taking the form 

gl/M = A'^^(+)(z/ = 0,UB = 0, k"" = 0)/K^^'^(A;2 = Q), 

with gr the pion-nucleon coupling constant, M the nucleon mass, —k"^ the squared 
mass of the initial pion (the final pion is still on mass shell), and with K^'^'"{0) the 
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pionic form factor of the nucleon, normalized so that K^^'^{—M'^) = 1. This seemed 
absolutely remarkable, and I immediately proceeded to do a dispersion relation 
evaluation of the pion-nucleon amplitude on the right, using the Roper (1964) phase 
shift analysis as input, and assuming that the effects of off-shell continuation in 
as in K ) were small. In setting up this calculation, I used several 
theoretically equivalent ways of writing the subtracted dispersion relation to get an 
estimate of the errors in the analysis. The Christenson-Cronin-Fitch-Turlay (1964) 
experiment on CP violation had a substantial block of computer time reserved for 
analysis, and courtesy of them I was able to use a small amount of their time to 
run my programs, a few days before I was scheduled to give a talk at Columbia. I 
recall staying up all night to get the job done, and at one point, in the wee hours 
of the morning, dropping my deck of cards and then having to spend precious time 
getting them back in the proper order. But I did get my calculation done by morning 
(and never again attempted an "all-nighter".) The relation worked very well, and 
as Trciman later said, "now there is a second number"; PCAC was starting to look 
interesting. This work became the final chapter of my thesis. 

Immediately after finishing my thesis I took a summer job at Bell Laborato- 
ries at Murray Hill, nominally working for Phil Anderson. I wanted to learn about 
superconductivity, and Phil assigned me to work for Dick Werthamer. I did learn 
about the BCS and Ginzburg-Landau theories, and Abrikosov vortices in type-II 
superconductors, but I did not succeed in my project with Dick, which was to try 
to understand the resistance to vortex line motion using thermal Green's functions. 
A few weeks before the end of the summer, I asked for and got Phil's permission to 
spend some time writing a paper on the pion-nucleon consistency condition (Adler, 
1965a, R5), which I also then extended to pion-pion and pion-lambda scattering. In 
the pion-pion case, since there are no pole terms, the consistency condition takes the 
form that the pion-pion scattering amplitude with one zero mass pion, evaluated at 
the symmetric point s = t = u = M^, is zero. This was the first example of a soft 
pion zero or, as termed in the literature, "Adler zero", in non-baryonic amplitudes, 
that I will return to shortly. Knowing that I was planning to go on in particle theory, 
Phil told me one day that he had an interesting paper to show me, which had just 
been submitted to the journal Physics which he was editing. It was Gell-Mann's 
(1964) paper on current algebra; Phil let me read it, but not Xerox it. This was to 
prove decisive for my work on sum rules nine months later. My interactions with 
Phil however were brief, and never touched on the subject of symmetry breaking in 
superconductivity and particle physics, on which Phil had written a paper (Ander- 
son, 1963) that I learned of only many years later, that was a forerunner of work on 
the "Higgs mechanism" for giving masses to vector bosons. 

In the fall of 1964 I moved to Harvard as Junior Fellow in the Society of Fel- 
lows, sharing a postdoc office next to the office occupied by Henry Ehrenreich in 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



14 Adventures in Theoretical Physics 

the Applied Physics division. (Henry had recently left General Electric to accept 
a Professorship at Harvard.) In principle I was going to do solid state physics as 
well as particle theory, but that never happened. I spent the fall term working on 
numerical aspects of my weak pion production calculation, and also reading papers 
on attempts to calculate the axial- vector renormalization constant qa-, including the 
papers of Gell-Mann and Levy (1960) and Bernstein, Gell-Mann and Michel (1960). 
I had a hunch that the fact that qa is near one was somehow connected with PCAC, 
but I did not see a concrete way of exploiting PCAC in a calculation. I also was 
starting to think about how to make the PCAC consistency condition calculations 
independent of the cumbersome Lorentz invariant amplitude apparatus that I had 
used to get them. I soon found that the relevant terms could be isolated directly from 
the Feynman diagrams without invoking all the formal kinematic apparatus of my 
thesis, and this approach extended to a general matrix element as well; the strategy 
was the same one that I had used in the paper on CVC and PCAC tests, of going 
from a particular observation in the context of my weak pion production calculation 
to something more general. The result was a formula for soft pion production, in 
terms of external line insertions on the hadronic amplitude for the same process 
in the absence of the pion (Adler, 1965b, R6). For baryons of nonzero isospin, the 
insertion factors are nonzero, while for isospin zero baryons, and mesons such as the 
pion or kaon, the insertion factor vanishes. This latter result generalizes the soft-pion 
zero or "Adler zero" to the emission of a soft pion in any reaction involving only 
incoming and outgoing mesons, but no external baryons. These zeros continue to 
play a role in the analysis of experimental results on mesonic resonances; for recent 
discussions, see Bugg (2003, 2004) and Rupp, Klecfeld, and van Beveren (2004). 

The soft pion zeros are an indication that according to PCAC, the pion cou- 
pling to other hadrons is effectively pseudovector, and not pseudoscalar. When I 
visited CERN in late January of 1965, while in the midst of work on the Feynman 
diagram approach to the PCAC consistency conditions, I found that Veltman had 
been thinking in a similar direction, but had not reached the point of writing down 
external line insertion rules. Veltman gave me a one page memo to file that he had 
written, which pointed out that my PCAC consistency conditions are equivalent to 
pseudovector coupling, which implies the vanishing of invariant amplitudes for soft 
pion emission after singular terms are split off. Veltman also noted that Feynman 
had briefly remarked on the relation between the Goldberger-Treiman relation and 
pseudovector coupling in his conference summary talk at Aix-en-Provence in 1962, 
and gave me a copy of the relevant page. Feynman did not, however, report agree- 
ment with experiments on pion-nucleon scattering, apparently because he did not 
recognize the necessity of splitting off the singular Born terms before concluding 
that pion emission amplitudes vanish in the soft pion limit. 

In the course of my work on the insertion rules I remembered the paper of Nambu 
and Shrauner (1962) which I had briefly mentioned in the Introduction to my thesis; 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



High Energy Neutrino Reactions, PCAC Relations, and Sum Rules 15 

I now looked this up, as well as the paper of Nambu and Luric (1962) on which it 
was based, and saw that my final formula, when specialized to the case of an ingoing 
and outgoing nucleon line, was substantially the same as the pion bremsstrahlung 
formula of Nambu and Lurie. I noted this in my paper, and consistently referred to 
the Nambu papers from this point on. In recognition of Nambu's work, I used his 
notation x stnd term "chirality" to refer to the integrated axial- vector charge in my 
next two papers, which dealt with the qa sum rule; however, in modern terms this 
is a misnomer, since chirality is now used to mean the left- or right-handed sums of 
vector and axial-vector charges. Gell-Mann's notation for the axial-vector charges 
has become the standard one, and after these two papers I followed the Gell-Mann 
notation. 

The comparison with Nambu's approach also raised the issue of the role of the 
pion mass: do the PCAC results limit smoothly to the zero pion mass ones, for which 
the soft pion theorem derivations appear quite different? This point was dealt with in 
footnote 6 of my paper R6, where I showed that the limits, (1) pion mass approaches 
zero, and (2) pion four momentum transfer squared approaches zero, can be taken 
in either order; the same soft pion theorem results, although the contribution which 
comes from the massless pion pole when the limit (1) is taken first, comes instead 
from the axial-vector divergence when the limit (2) is taken first. This point is now 
taken for granted, but in the early years it caused me (and others) considerable 
confusion. After this paper I almost immediately got involved with sum rules, and 
so I did not publish the detailed connection between my second PCAC paper and 
the Nambu-Lurie approach until a few years later, when I included it as "Appendix 
A" of Chapter 2 of the book on Current Algebras which I put together with Roger 
Dashen (Adler and Dashen, 1968). This appendix is reprinted here as R7. At the 
end of Appendix A, I again discussed the relationship between the zero pion mass 
and nonzero pion mass calculations. The analysis of Appendix A also shows how the 
PCAC approach to soft pion theorems that I had developed fixes the undetermined 
renormalization constant appearing in the chirality approach of Nambu-Lurie. In 
the formulas of Appendix A, there are factors of gA that are missing in the formulas 
of the papers of Nambu, Lurie, and Shrauner. Correspondingly, in the paper of 
Nambu and Lurie, in the discussion associated with their Eq. (2.7), they noted that 
a renormalization constant Z appears, but didn't observe that this can be precisely 
identified as gA- Instead, they redefined their chirality as Z~^Xj that is as {gA)~^X- 
They then made a compensating adjustment in the pion decay constant in their Eq. 
(4.5), where they dropped the gA factor which appears in the Goldberger-Treiman 
relation. Nambu and Lurie say there, "1/A is more or less the conventional pion 
coupling constant 1/A = / = g/2m. (4.5) It is not proven, however, that this agrees 
with the coupling constant defined in the dispersion theory. For the time being, we 
assume it to be the case." In the subsequent paper of Nambu and Shrauner (1962), 
an identification of / with the standard pion-nucleon coupling was established, but 
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the issue of where to include factors of qa was not addressed. My impression from 
this was that there was some uncertainty in the minds of Nambu and his students 
about how the chirahty is to be normahzed, and this impression was reinforced 
by a conversation I later had with Nambu about their work and my Appendix A 
derivation of their result. 

In the low energy theorem for one soft pion which Nambu and Lurie had derived, 
and which I had obtained from PCAC and the Feynman rules in my second PCAC 
paper, the gA factor drops out, and so the normalization of the axial-vector charge or 
"chirality" is irrelevant. The applications discussed in the papers of Nambu, Lurie, 
and Shrauner all involved only one soft pion; Nambu and Lurie looked, for example, 
at TT + N^ir + N + n, with the final pion soft but with the other pions "hard" ; in 
fact, what they actually did was to calculate single soft pion emission in the reaction 
TT + N ^ A(1232). Similarly, Nambu and Shrauner (1962) analyzed single soft pion 
electroproduction and weak production, relating them to the form factors of the 
vector and axial-vector currents. In this paper they included current commutator 
terms by analogy with the classic Low (1958) paper on bremsstrahlung; their answer 
for electroproduction is correct because the gA factor drops out there anyway, but 
their answer for weak axial- vector production lacks gA factors in places, for reasons 
explained in the next paragraph. A follow-up paper of Shrauner (1963) dealt with 
single soft pion production in pion-nucleon scattering, with the scattering pions 
"hard" . My "PCAC consistency condition" was likewise a single soft pion theorem 
which gives a relation between the amplitude fov tt + N ^ N + tt, with the final 
pion soft, and the amplitude vr + A'^ N, which is just the pion-nucleon coupling 
constant, and involves no factors of qa- 

The factors of gA and the explicit identification of the "chirality"' with the charge 
associated with the axial-vector current become important, however, if one wants to 
discuss multiple soft pion production, and also weak axial-vector pion production, 
since one then encounters commutators of an axial- vector charge with an axial- vector 
charge or current, which are evaluated by the Gell-Mann current algebra. If one 
defines the relevant chirality as (^a)"^ times the axial- vector charge, as is implicit 
in the Nambu-Lurie paper when one identifies their Z with gA, then the relevant 
commutator is (^a)"^ times a vector charge, which at zero momentum transfer just 
gives (y^)"^- This is in fact the origin of the (y^)"^ term in the gA sum rule, where 
the difference between {gA)~^ and 1 is highly significant. The point, then, is that 
while Nambu and Lurie gave a correct formula for single soft pion production, it in 
fact cannot be generalized to multiple soft pion production (or soft pion production 
by the weak axial-vector current) without first dealing carefully with the question 
of normalization, as I did in my second PCAC paper R6 and in Appendix A of the 
book on current algebras R7. 

Another difference between the work of Nambu and his students, and what I 
did in my first PCAC consistency condition paper R5, related to the method of 
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comparison with experiment, and the level of accuracy claimed for soft pion pre- 
dictions. The Goldberger-Treiman relation is good to about 7% accuracy, and my 
comparison of the PCAC consistency condition with experiment also indicated that 
the relation was satisfied to about 10%, thus reinforcing the idea that PCAC could 
be used as a quantitative tool for studying the strong interactions, with the residual 
errors arising from the extrapolation of the pion four-momentum squared k"^ from 
to 0. Given that the pion mass is much smaller than all other hadron masses, an 
extrapolation error ~ /^hadron — ^-^ reasonable. The success of the qa sum 
rule shortly afterwards gave further support to the idea that PCAC gives quanti- 
tatively accurate predictions. Nambu, Lurie, and Shrauner, however, argued only 
for qualitative agreement between their soft pion results and experiment based on 
comparisons of rescaled angular distributions, but did not find anything close to 
~ 10% agreement for absolute cross sections. For example, for the relation between 
the cross sections for pion-nucleon scattering with production of an additional pion, 
and pion-nucleon scattering, Nambu and Lurie (1962) showed agreement with their 
predictions to within roughly a factor of three (giving a predicted cross section of 
0.2 mb versus experimental values in the range 0.6 to 0.7 mb). Similarly, for the 
same reaction Shrauner (1963) found that "the magnitudes of the cross sections 
seem to be significantly underestimated by a factor of about 7" . The source of these 
discrepancies is not clear. They may be due, in part, to the fact that, instead of 
testing the soft pion predictions at the kinematic point of zero pion four momentum 
(such as the point u = vb = used in my PCAC consistency condition work), 
Nambu, Lurie, and Shrauner did the comparisons in energy intervals above scat- 
tering threshold. (However, Shrauner argues, on the basis of branching ratios, that 
the discrepancy is probably not attributable to an overlap of the A(1232) resonance 
with the comparison region.) I think that a combination of lack of clarity about how 
their chiral current was related to the physical axial- vector current, as reflected in 
the normalization problems noted above, together with the lack of striking quanti- 
tative comparisons with experiment, were responsible for the work of Nambu and 
his students being largely unnoticed by the community. It was only after the quan- 
titative successes of PCAC in my consistency condition paper and in the gA sum 
rule that followed shortly afterwards, and my demonstration of the equivalence be- 
tween the PCAC insertion rules and the chirality conservation approach, that the 
significance of the work of the Nambu group became clear. 

Finally, as an historical footnote to this discussion of soft pion theorems, Tou- 
schek (1957) appears to have been the first to introduce continuous 75 symmetry 
transformations, as applied to the neutrino field, and to observe that invariance un- 
der these transformations requires that the neutrino mass be zero. Nishijima (1959) 
(in work submitted for publication in late 1958) considered continuous 75 symmetry 
transformations in theories of massive fermions; to preserve 75 invariance he gauged 
the transformations with a massless pseudoscalar boson, transforming as B ^ B + \ 
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under a 75 transformation with parameter A. The action written in Nishijima's pa- 
per is just the effective action one would now write for a singlet Nambu-Goldstone 
boson (such as an axion) coupled to a massive fermion. Nambu (1959), in remarks 
at the Kiev Conference, noted the analogy between 75 symmetry in particle physics 
and gauge invariance in superconductivity, and related this to his suggestion that 
a nucleon-antinucleon pair in a pseudoscalar state could be the pion. This idea was 
further developed in the well-known paper Nambu and Jona-Lasinio (1961), that 
laid the basis for the modern theory of Nambu-Goldstone bosons associated with 
spontaneous symmetry breaking, and for the fact that most of the mass of the nu- 
cleon comes from chiral symmetry breaking. In the meantime, Giirsey (1960) had 
introduced isovector 75 transformations, as an extension of the similar isoscalar 
transformations used by Nishijima, and constructed a precursor to nonlinear pion 
effective Lagrangians. These papers all contained important seeds of our present-day 
understanding of chiral symmetries. 



Sum Rules 

I have now gotten ahead of the chronological story; a lot of things happened very 
fast in 1965. In the fall of 1964 I started thinking about the question of the renor- 
nialization of the nuclcon axial- vector coupling qa- and accumulated a file of papers 
on the subject. However, my attempts at a calculation were based on the commu- 
tator of the nucleon field with the weak axial-vector charge, giving results identical 
to those already obtained by Bernstein, Gell-Mann, and Michel (1960), which ex- 
pressed gA in terms of unmeasurable off-shell form factors, but achieving no further 
progress. In early 1965 I saw a preprint of Fubini and Furlan (published as Fubini 
and Furlan, 1965) which applied the commutator of vector current charges, together 
with the ingenious idea of going to an infinite momentum frame, to calculate the 
radiatively induced renormalization of the vector current. (Harvard did not have a 
preprint library in those days, but Schwinger's secretary Shirley would let me into 
his office from time to time to look through the unread preprints that were stacked 
on his desk. This presented no difficulty since Schwinger was a night-owl who mainly 
worked at home, and used his office only a few hours a week, when he came in to 
lecture and to see students. That is how I became aware of the Fubini-Furlan pa- 
per. As a result of this experience, one of the first things I did when I arrived at 
the Institute for Advanced Study eighteen months later was to start a preprint li- 
brary for the particle physicists.) I immediately thought about applying this to the 
axial-vector current, using the Gell-Mann current algebra that I'd seen the previous 
summer at Bell Labs. However, because of other things I was working on I didn't get 
around to it until a few months later, when in a chance encounter Arthur Jaffe told 
me that he had heard a talk by Roger Dashen about work he and Gell-Mann had 
been doing on sum rules. I decided I had better stop delaying (although it turned 
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out that Dashen and Gell-Mann were working on fixed momentum transfer sum 
rules), dropped my weak pion production computer work, and spent spring break 
working out the consequences of combining the Gell-Mann current algebra, PC AC, 
and the Fubini-Furlan method. It turned out to be surprisingly easy, with the infi- 
nite momentum frame solving a problem I had encountered in earlier attempts to 
calculate g^, which is that the axial- vector charge matrix element is proportional to 
the nucleon velocity, and vanishes for nucleons at rest. I soon had a formula relating 
the difference between 1 and {gA)~^ to a convergent integral over a difference of 
pion-nucleon cross sections. 



with and Mn the pion and nucleon masses, a^^W) the total cross section for 
scattering of a zero-mass vr^ on a proton at center-of-mass energy W, and again with 
K^^'^{0) the pionic form factor of the nucleon, normalized so that K^^'^ {—M"^) = 
1. I first tried to saturate the integral in the narrow A(1232) approximation, and 
the result was a disappointing = 3. I then pulled out the computer deck I 
had used for the consistency condition numerical work the previous year, did the 
integral carefully, and got qa = 1.24. I also observed that the relation for qa could 
be equivalently recast as a two-soft pion low energy theorem, 



u and VB are again the energy and momentum transfer variables, and MV are 
the initial and final pion masses, which are now both off shell. A few days after I 
submitted a letter to Physical Review Letters^ Sidney Coleman returned from a trip 
to SLAC and when I described my results to him, he told me that he had just heard 
about a similar calculation being done there by Bill Weisberger, whose points of 
departure were the same as mine: the Gell-Mann current algebra, the Fubini-Furlan 
paper, and my paper on PCAC consistency conditions. I talked to Weisberger by 
phone, and then called PRL and asked them to delay publication of my letter until 
they received the manuscript Weisberger was preparing. My paper (Adler, 1965c, 
R8) and Weisberger's (Weisberger, 1965) appeared as back-to-back letters in the 
June 21 issue. They give substantially identical derivations; Weisberger's numerical 
result of 1.16 differed from mine of 1.24 because I had included a correction for the 
off-pion-mass-shell extrapolation of the threshold phase space factor associated with 
the A(1232) resonance, which I knew from my work on weak pion production could 
be reliably estimated. At the time, this correction made agreement with experiment 
worse (the experimental value for qa was then 1.18), but the best value now has 
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settled down to qa = 1.257 it .003, in gratifyingly good agreement with the value I 
got when I included the kinematic extrapolation correction. Weisberger and I both 
submitted longer papers to Physical Review describing our work (Adler, 1965d, 
R9); Weisberger (1966). These emphasized the low energy theorem approach to 
the relation for 5^, giving historically the first two-soft pion low energy theorem. 
In my paper I also gave an analog for pion-pion scattering, and then in the final 
section (Adler, 1965, R9, Section V), I returned to the observation that I had made 
a year earlier about forward lepton scattering, and showed that the qa sum rule 
could be converted to an exact relation, involving no off-shell PCAC extrapolation, 
for forward inelastic high energy neutrino reactions. This relation, which provided 
a test of the Gell-Mann current algebra of axial-vector charge commutators, was 
another indication of a deep connection between the structure of currents on the 
one hand, and inelastic lepton scattering on the other. 

The QA sum rule provided yet a third result supporting the use of PCAC as a 
method for calculating soft pion processes. Simultaneously, it was a stunning success 
for Gell-Mann's brilliant idea of abstracting the current algebra from the naive quark 
model, with the hope that it would prove to be a feature that would also be valid 
in the then unknown theory of the strong interactions. At this point the whole 
community took notice, and a string of current algcbra/PCAC applications appeared 
in rapid succession. To mention just a few, Weinberg (1966a) and Tomozawa (1966) 
reexpressed the soft pion theorems for pion-nucleon scattering, coming from my 
consistency condition papers and the qa sum rule papers, in the form of formulas 
for the pion-nucleon scattering lengths, and Weinberg in the same paper also used my 
result of a PCAC zero in pion-pion scattering, plus a symmetry argument, as inputs 
for a derivation of pion-pion scattering lengths. Weinberg (1966b) also generalized 
the two-soft pion low energy form of the qa sum rule to a general formula for multiple 
soft pion production. Finally, in another striking application of soft pion theorems, 
Callan and Treiman (1966) gave a series of important results for K meson decays, 
in which the role of rapidly varying pole terms was clarified in Weinberg (1966c). 

In connection with the qa sum rule, I have an interesting Feynman anecdote 
to relate. I spent the spring term of 1966 as a member of Murray Gell-Mann's 
postdoctoral group at Cal Tech. A few weeks after I arrived, Feynman asked me to 
stop by his office to look at some pages in his notebook, in which he had almost 
derived the qa sum rule, before Weisberger and I did it. The whole expression was 
there (including the kinematic correction that I had included for the off-mass-shell 
extrapolation), except that, where the Gell-Mann algebra had dictated a 1 coming 
from the commutator of two axial-vector charges giving an unrenormalized vector 
charge, Feynman had put 0! So numerically the relation did not work, and Feynman 
had given up on it and gone on to other things. He evidently had not paid attention 
to Gell-Mann's current algebra, or at least not realized, from his heuristic way of 
doing things, that it was essential for this calculation. 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



High Energy Neutrino Reactions, PCAC Relations, and Sum Rules 21 

Returning again to events in 1965, as soon as the long paper on gA_ was com- 
pleted, I departed to be a summer visitor at CERN. There I met Murray Gell-Mann 
for the first time, and had long conversations with him. Murray was particularly 
interested in the Section V relation between the current algebra of vector and axial- 
vector charges and forward high energy neutrino reactions, and urged me to try to 
extend it to a test of the local current algebra which he had given in his Physics 
paper (Gell-Mann, 1964). I spent the summer working on this, and found that I 
could do it; as I recall, the crucial bits came together when I spent a day work- 
ing at a kitchen table during a week off for holiday at Lake Garda. The results 
were written up in the late summer of 1965 at CERN and/or Harvard, and ap- 
peared in Adler (1966), RIO. This article gave the first detailed working out of the 
structure of deep inelastic high energy neutrino scattering (the electroproduction 
case was given independently in the review of de Forest and Walecka (1966)) , with 
both the electroproduction and neutrino cases specific examples of general local 
lepton coupling theorems given by Lee and Yang and by Pais, as referenced in my 
article RIO. However, the a, (3,^ notation that I used for the structure functions 
did not become the standard one; the now standard Wi,2,3 structure functions, 
which follow the notation of de Forest and Walecka and were further popularized 
by Bjorkcn, are linearly related to the ones I used. [Specifically, I separated the 
cross section into strangeness-conserving and strangeness-changing pieces, whereas 
the current convention is to define the structure functions as the sum of both. At 
zero Cabibbo angle, the relation between my a, /?, 7 and the conventional W"i,2,3 is 
a = Wi, (3 = W2, 2Mjv7 = W3, with M^r the nucleon mass. For general Cabibbo 
angle 6c, one has cos^ ^?c'/5^^q + sin^ 6c f3^^g^}_-^ = VF^''^, with similar relations for 
the other two structure functions.] The article actually gave three sum rules; two for 
the a and 7 structure functions which subsequent analysis by Dashen showed to be 
divergent and hence useless, and one for the /? deep inelastic amplitude which is a 
convergent and useful relation. The beta sum rule divides into axial-vector and vec- 
tor parts, which are separately given as Eqs. (53a) and (53b) respectively of Adler 
(1966), RIO, and which when added to give the total AS* = cross section yield 

2 = 9A{qr+Fr{Qr + Q'Fr{Qr+ r ^dW\p(-\q^W)-/3(+\q\W)] . 

This sum rule (and the ones for the separate vector and axial- vector contributions) 
has the notable feature that the left-hand side is independent of g^, even though the 
Born term contributions and the continuum integrand on the right are g^-dependent. 
At zero squared momentum transfer q^, the axial- vector part of the [5 sum rule 
reduces to the relation I gave in my long paper on qa, which had prompted Gell- 
Mann's question about a generalization; the first derivative of the vector part with 
respect to at = gives the sum rule also derived by Cabibbo and Radicati 
(1966) using moments of currents. Because the neutrino and antineutrino differential 
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cross sections (Pa/d{q'^)dWare dominated by the (5 structure function in the hmit 
of large neutrino energy, by integrating over W one gets the limiting cross section 
relation (at zero Cabibbo angle) 



lim 

Ev^oo 



da{u + p) da{u + p) 



91 

TT 



d{q^) d{q^) 

with G the Fermi constant. Similar relations at non-zero Cabibbo angle are given 
in Eq. (27) of RIO, and it is easy to obtain analogous relations for the vector and 
axial-vector contributions to the cross sections taken separately. 

In late October of 1965 I spoke on "High Energy Semileptonic Reactions" at the 
International Conference on Weak Interactions held at Argonne National Labora- 
tory (Adler, 1965e), in which I gave the first public presentation of the local current 
algebra sum rules for the (3 deep inelastic neutrino structure functions, and the lim- 
iting relations for the differential cross sections that they imply. In the published 
discussion following this talk, in answer to a question by Fubini, I noted that the (3 
sum rule had been rederived by Callan (unpublished) using the infinite momentum 
frame limiting method, but that the a and 7 sum rules could not be derived this 
way, reinforcing suspicions that "the integral for [3 is convergent, while the other 
two relations (for a and 7) really need subtractions." Bjorken was in the audience 
and was intrigued by the j3 sum rule results, and soon afterwards converted them 
into a differential cross section inequality (Bjorken, 1966, 1967) for deep inelastic 
electron-nucleon scattering, for which there was the prospect of experimental tests 
relatively soon. To see why the neutrino cross section relation given above implies 
an inequality for electron scattering, one notes that since the v + p differential cross 
section is positive, the right-hand side G^/tt gives a lower bound for the V + p dif- 
ferential cross section, with a similar lower bound holding for the vector current 
contribution alone. But noting that according to CVC, the vector weak current is 
in the same isospin multiplet as the isovector part of the electromagnetic current, 
and using the Wigner-Eckart theorem, one gets a corresponding lower bound for the 
inelastic differential cross section induced by an isovector virtual photon scattering 
on a nucleon. One then notes that in the scattering of a virtual photon on a target 
containing equal numbers of neutrons and protons, the isovector and isoscalar cur- 
rents add incoherently, and so the isovector current contribution alone gives a lower 
bound. Combining the two bounds, and including an extra 1/(A;^)^ for the virtual 
photon propagator, replacing G by the fine structure constant a, and keeping track 
of numerical factors, one gets Bjorken's electron scattering result 

d[cT{e + p) + a{e + n)] 2^a^ 

d(fe2) ^ (k2)2 ' 

which was testable in the experiments soon to begin at SLAC. Verification of my 
neutrino sum rule, on the other hand, took two decades and more; see AUasia et 
al. (1985) for the first reported test, and Conrad, Shaevitz, and Bolton (1998) for 
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more recent high precision results. For a recent study of my neutrino sum rule, in 
comparison with the Gottfried (1967) sum rule for electron-proton scattering, within 
the framework of the large Nc expansion of QCD with Nc colors, see Broadhurst, 
Kataev, and Maxwell (2004) and Kataev (2004). 

Although not directly tested until many years after it was derived in 1965, my 
neutrino sum rule had important conceptual implications that figured prominently 
in developments over the next few years. To begin with, it gave the first indica- 
tions that deep inelastic lepton scattering would give information about the local 
properties of currents, a fact that at first seemed astonishing, but which turned out 
to have important extensions. Secondly, as noted by Chew in remarks at the 1967 
Solvay Conference (Solvay, 1968), the closure property tested in the sum rules, if 
verified experimentally, would suggest the presence of elementary constituents in- 
side hadrons. In a Letter (Chew, 1967) published shortly after this conference. Chew 
argued that my sum rule, if verified, would rule out the then popular "bootstrap" 
models of hadrons, in which all strongly interacting particles were asserted to be 
equivalent ("nuclear democracy"). In his words, "such sum rules may allow con- 
frontation between an underlying local spacetime structure for strong interactions 
and a true bootstrap. The pure bootstrap idea, we suggest, may be incompatible 
with closure." In a similar vein, Bjorkcn, in his 1967 Varenna lectures (Bjorkcn, 
1968), argued that the neutrino sum rule was strongly suggestive of the presence 
of hadronic constituents, and this was also noted in the review of Llewellyn Smith 
(1972). 

These conceptual developments still left undetermined the mechanism by which 
the neutrino sum rule, and Bjorkcn's electron scattering inequality, could be sat- 
urated at large q^. During my visit to Cal Tech in 1966, I renewed my graduate 
school acquaintance with Fred Oilman and worked with him on two projects. One 
was an analysis of the saturation of the neutrino sum rule for small q'^ (Adlcr and 
Oilman, 1967, Rll), in which we concluded that SLAC (soon to start operating) 
would have enough energy to confront the saturation of the nonzero q^ sum rules 
in a meaningful way. In this paper, we noted that the (3 sum rule posed what at 
the time was a puzzle: the left-hand side of the sum rule is a constant, while the 
Born terms on the right are squares of nucleon form factors, which vanish rapidly as 
the momentum transfer q^ becomes large. The low lying nucleon resonance contri- 
butions on the right were expected to behave like the A(1232) contribution, which 
is form factor dominated and also falls off rapidly with q^. Hence it was clear that 
something new and interesting must happen in the deep inelastic region if the sum 
rule were to be satisfied for large q"^: "to maintain a constant sum at large g^, the 
high W states, which require a large E to be excited, must make a much more 
important contribution to the sum rules than they do at q^ = 0". We were cau- 
tious, however (too cautious, as it turned out!), and did not attempt to model the 
structure of the deep inelastic component needed to saturate the sum rule at large 
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q^. Bjorken became interested in the issue of how the sum rule could be saturated, 
and formulated several preliminary models that (in retrospect) already had hints of 
the dominance of a regime where the energy transfer v grows proportionately to the 
value of g^. I summarized these pre-scaling proposals of Bjorken in the discussion 
period of the 1967 Solvay Conference (Solvay, 1968), which Bjorken did not attend, 
in response to questions from Chew and others as to how the neutrino sum rule 
could be saturated. The precise saturation mechanism was clarified (to a very good 
first approximation) some months after the Solvay conference with the proposal by 
Bjorken (Bjorken, 1969) of scaling, and soon afterwards, with the experimental work 
at SLAC on deep inelastic electron scattering, that confirmed Bjorken's intuition. 
For a very clear exposition of the relation between scaling and the neutrino sum 
rule, see Sec. 3.6B of Llewellyn Smith (1972), who notes that when the sum rule is 
rewritten in terms of Bjorken's scaling variable w, "The simplest way to ensure the 
[my q^] independence of the left-hand side as ^ oo is to assume that the 
limit in eq.(3.71) [ in my notation, hmQ2_>QQ j^g^ed 

^^^\uj,Q'^/Ml)] exists". 



More Low Energy Theorems; Weak Pion Production Redux 

In the fall of 1965 I received an invitation from Oppenheimcr, which I accepted, to 
come to the Institute for Advanced Study as a long term member with a five year 
appointment, starting in the fall of 1966. Roger Dashen, whom I had met briefiy 
when he visited Harvard earlier in 1965, received a similar invitation. The intent 
behind our appointments was that we would reinvigorate high energy theory at the 
Institute, which had fallen into a decline with the departures of Lee, Yang, and Pais 
to professorships elsewhere, and with a turn of Dyson's research interests towards 
astrophysics. 

Before going to Princeton, as mentioned above, I spent the spring term of 1966 
as a postdoc in Murray Gell-Mann's group at Cal Tech. By this time the successes of 
PCAC and current algebra had attracted a lot of attention and stimulated an out- 
pouring of papers, the more important ones of which appear in the volume which 
Dashen and I put together a year later. My own work in the spring of 1966 was 
focused on two issues. The first involved using PCAC to get small momentum ex- 
pansions of matrix elements of the axial-vector current, in analogy with the paper of 
Low (1958) on soft photon bremsstrahlung. With Joe Dothan, I wrote a long paper 
(Adler and Dothan, 1966, R12) applying these ideas to the weak pion production 
amplitude and to radiative muon capture. The weak pion results figured in my later 
comprehensive paper on the subject (see below), while the radiative muon capture 
work was incorporated into later chiral perturbation theory treatments of radiative 
muon capture; for a review of the current theoretical and experimental status of 
muon capture, including a discussion of discrepancies between theory and exper- 
iment in the radiative capture case, see Gorringe and Fearing (2004). The other 
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direction of work involved two phenomenological studies done with Fred Oilman. 
One of these dealt with saturation of the neutrino sum rule, as described in the pre- 
ceding section. The other dealt with a detailed phenomenological study of the PCAC 
predictions for pion photo- and electro-production (Adler and Oilman, 1966, R13), 
including a saturation analysis for the Fubini-Furlan-Rossetti (1965) sum rule; for 
a recent update on this, see Pasquini, Drechsel, and Tiator (2005). 

My first year at the Institute was largely devoted to writing the book on Current 
Algebras with Roger Dashen (Adler and Dashen, 1968). The book consisted of se- 
lected reprints grouped by categories with commentaries that we supplied, plus some 
general introductory material. I was responsible for writing the introductory sections 
and the commentaries for Chapters 1-3, which included Appendix A, reprinted here 
as R7. Roger was responsible for the commentaries for Chapters 4-7, which included 
an original and very detailed analysis of precisely which sum rules could be derived 
by the infinite momentum frame method, or in different language, when a naive 
assumption of unsubtracted dispersion relations would (and would not) give correct 
results. This analysis confirmed earlier suspicions that my /? neutrino sum rule was 
correct, but that the a and 7 sum rules should have subtractions, and so were not 
useful. The book on Current Algebras was completed, and sent off to the publisher, 
in the fall of 1967. 

During this period I also worked with Bill Weisberger, who was then at Prince- 
ton, on sorting out the tricky pion pole structure in two pion photo- and electro- 
production, which had to be handled carefully to get a fully gauge-invariant ex- 
pression (Adler and Weisberger, 1968, R14). Our interest in this process, as noted 
in the title of the paper, was motivated by the fact that it gives an alterna- 
tive, indirect method of measuring the nucleon axial- vector form factor gA{k^)- 
An experiment to measure gA{k'^) by this method was carried out by Joos et al. 
(1976) giving a value = 1.18 ib 0.07 OcV for the mass in the dipole formula 
gA^k"^) = 5'yl(0)(l + A;^/m^)^^. This value is in good agreement with the value 
ruA = 1-07 zb 0.06 OeV given in the quasielastic scattering + n iJi~ + p exper- 
iment of Baker et al. (1981), and also in reasonable agreement with values of rriA 
obtained from single pion electroproduction at threshold using the low energy the- 
orem of Nambu and Shrauner (1962) (for which experimental references are given 
in both the Joos et al. and Baker et al. articles) . At the 1968 Nobel Symposium 
on Elementary Particle Theory, I gave a brief talk (Adler, 1968a) reviewing various 
methods that had been proposed to measure the nucleon axial-vector form factor: 
quasielastic neutrino scattering, neutrino production of the A(1232), electroproduc- 
tion of a single soft pion (the Nambu-Shrauner proposal), and electroproduction 
of the A(1232) plus an additional soft pion (the proposal of my paper R14 with 
Weisberger). Over the years since then, all of these methods have been carried out. 

I also returned, after completion of the book on Current Algebras, to the repeat- 
edly delayed project of completing the numerical work associated with my thesis 
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calculation of weak pion production, and this kept me busy until the spring of 1968, 
when I finished a comprehensive article on photo-, electro-, and weak single-pion 
production in the A(1232), or as it was then termed, the (3,3) resonance region 
(Adler, 1968b, R15). This paper is so long (123 pages) that it is not feasible to 
reprint it all here, so I have included only the introduction (Sec. 1) and part of the 
discussion of implications of PCAC (Sees. 5A and 5B). The basic approximation used 
in this paper consisted of using the Born approximation for all nonresonant multi- 
poles, augmented by terms coming from the PCAC low energy theorems, together 
with a unitarized Born approximation for the dominant resonant (3,3) multipoles, 
giving predictions for weak pion production in the (3,3) region in terms of the vec- 
tor and axial-vector form factors of the nucleon. By 1968 there were experimental 
results on pion electroproduction which were in satisfactory agreement with my the- 
ory, except for values of the momentum transfer k'^ significantly larger than roughly 
0.6(GeV/c)^, where in retrospect one can see effects from the scaling regime showing 
up. For neutrino pion production, preliminary comparison of my results with CERN 
data showed an axial- vector form factor gA{k'^) that falls off more slowly with fe^ than 
the vector form factors, with a dipole mass of ruA ~ 1.2GeV. A subsequent compar- 
ison of my model with high-statistics neutrino data from Brookhavcn by Kitagaki 
et al. (1986) gave good fits with a dipole mass of uia = 1-28 ± 0.11 GcV, somewhat 
high compared to values obtained by other methods described above. Reasonable 
fits of my model to the A cross section and density matrix elements measured in 
the hydrogen bubble chamber at Argonne were also reported in papers of Schreiner 
and von Hippel (1973a, b), and a comparison with other models and data was given 
by Rein and Schgal (1981). (For a recent alternative approach to A(1232) weak 
production, and extensive references to earlier theoretical and experimental studies 
of this reaction, see Paschos et al. (2004).) After 1968 I did not work again on weak 
pion production until 1974-75, when the subject became important because it was 
an avenue for exploring weak neutral currents, as discussed in Chapter 5 below. 

To conclude this section on low energy theorems, let me address the question of 
the extent to which the modern viewpoint, of pions as Nambu-Goldstone bosons, 
entered into my work. The earliest reference that I could find in my research notes 
to the "Goldstone theorem" (and specifically to the derivations given in the paper 
of Goldstone, Salam, and Weinberg, 1962) dates from the spring of 1967, in other 
words, after nearly all the work on soft pion theorems was completed. (This reference 
was in the context of calculations on the axial-vector vertex in QED that were the 
starting point of my work on the axial anomaly, to be discussed in the next chapter.) 
I fully appreciated the role of pions as Nambu-Goldstone bosons only after hearing 
seminars that referred to Nambu-Goldstone versus Wigner-Weyl representations of 
75 symmetry, which were connected (as best I recall) with the work of Gell-Mann, 
Cakes, and Renner (1968) and Dashen (1969) on chiral SU{?>) x SU{2)) as a strong 
interaction symmetry. This may at first seem surprising, but now that the tapestry 
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of the standard model is completed, we see clearly the interrelations of its many 
threads; at the time when these threads were being laid down, those working from 
one direction were often unaware or only dimly aware of progress from another. 

Perhaps this is also a good point to say that the elucidation of the chiral struc- 
ture of the strong interactions was only one of the results flowing from the successes 
of current algebra methods and PC AC; something that was perhaps even more sig- 
nificant at the time was the demonstration that quantum field theory methods were 
really valid, after all, in dealing with hadronic interactions. When I entered gradu- 
ate school, the prevailing view was that the strong interactions would be understood 
through some kind of dispersion theoretic "reciprocal bootstrap" , and nearly every 
particle physics talk I heard began with a Mandelstam diagram on the blackboard. 
By 1967, this view had changed; it was clear that field theory could produce re- 
sults which could not be obtained from the dispersion relations program, and this 
strongly influenced subsequent developments. 
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3. Anomalies: Chiral Anomalies and Their Nonrenormalization, 
Perturbative Corrections to Scaling, and Trace Anomalies 

to all Orders 



Chiral Anomalies and tt" — ^ 77 Decay 

I got into the subject of anomalies in an indirect way, through exploration during 
1967-1968 of the speculative idea that the muon-electron mass difference could be 
accounted for by giving the muon an additional magnetic monopole electromagnetic 
coupling through an axial-vector current, which somehow was nonperturbatively 
renormalized to zero. After much frtiitless study of the integral equations for the 
axial-vector vertex part, I decided in the spring of 1968 to first try to answer a 
well-defined question, which was whether the axial- vector vertex in QED was renor- 
malized by multiplication by Z2, as I had been implicitly assuming. At the time 
when I turned to this question, I had just started a 6-week visit to the Cavendish 
Laboratory in Cambridge, England after flying to London with my family on April 
21, 1968 (as recorded by my ex- wife Judith in my oldest daughter Jessica's "baby 
book"). In the Cavendish I shared an office with my former adviser Sam Treiman, 
and was enjoying the opportunity to try a new project not requiring extensive com- 
puter analysis; I had only a month before finished my Annals of Physics paper R15 
on weak pion production (see Chapter 2), which had required extensive computa- 
tion, not easy to do in those days when one had to wait hours or even a day for the 
results of a computer run. 

My interest in the multiplicative renormalization question had been piqued by 
work of van Nieuwenhuizen, in which he had attempted to demonstrate the finite- 
ness to all orders of radiative corrections to jj, decay, using an argument based on 
subtraction of renormalization constants that I knew to be incorrect beyond lead- 
ing order. I had learned about this work during the previous summer, when I was 
a lecturer at the Varcnna summer school held by Lake Como from July 17-29, 
1967, at which van Nieuwenhuizen had given a seminar on this topic that was cri- 
tiqued by Bjorken, another lecturer. (For further historical details about this, see my 
review article Adler (2004a) on anomalies and anomaly nonrenormalization, from 
which much of this commentary has been adapted.) Working in the old Cavendish, I 
rather rapidly found an inductive mtiltiplicativc rcnormalizability proof, paralleling 
the one in Bjorken and Drell (1965) for finitencss of Z2 times the vector vertex. 
I prepared a detailed outline for a paper describing the proof, but before writing 
things up, I decided as a check to test whether the formal argument for the closed 
loop part of the Ward identity worked in the case of the smallest loop diagram. This 
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is a triangle diagram with one axial and two vector vertices (the AW triangle; 
see Fig. 1(a)), which because of Furry's theorem (C invariance) has no analog in 
the vector vertex case. I knew from a student seminar that I had attended during 
my graduate study at Princeton that this diagram had been explicitly calculated 
using a gauge-invariant regularization by Rosenberg (1963), who was interested in 
the astrophysical process + v ^ ^ + with 7y a virtual photon emitted by a 
nucleus. I got Rosenberg's paper, tested the Ward identity, and to my astonishment 
(and Treiman's when I told him the result) found that it failed! I soon found that 
the problem was that my formal proof used a shift of integration variables inside 
a linearly divergent integral, which (as I again recalled from student reading) had 
been analyzed in an Appendix to the classic text of Jauch and Rohrlich (1955), with 
a calculable constant remainder. For all closed loop contributions to the axial vertex 
in Abelian electrodynamics with larger numbers of vector vertices (the AVVVV, 
AVVVVVV loops; see Fig. 1(b)), the fermion loop integrals for fixed photon 
momenta are highly convergent and the shift of integration variables needed in the 
Ward identity is valid, so proceeding in this fermion loop-wise fashion there were 
apparently no further additional or "anomalous" contributions to the axial-vector 
Ward identity. With this fact in the back of my mind I was convinced from the outset 
that the anomalous contribution to the axial Ward identity would come just from 
the triangle diagram, with no renormaUzations of the anomaly coefficient arising 
from higher order AVV diagrams with virtual photon insertions. 




V V V V V V V V 

(a) (b) 



Fig. 1. Fermion loop diagam contributions to the axial- vector vertex part. Solid lines are 
fermions, and dashed lines are photons, (a) The smallest loop, the AVV triangle diagram, (b) 
Larger loops with four or more vector vertices, which (when summed over vertex orderings) 
obey normal Ward identities. 

In early June, at the end of my 6 weeks in Cambridge, I returned to the US and 
then went to Aspen, where I spent the summer working out a manuscript on the 
properties of the axial anomaly, which became the body (pages 2426-2434) of the 
final published version (Adler, 1969, R16). Several of the things done there deserve 
mention, since they were important in later applications. The first was a calculation 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



36 Adventures in Theoretical Physics 

of the field theoretic form of the anomaly, giving the now well-known result 

with = ip^n^^ijj the axial- vector current (referred to above as A), = ipj^ip the 
pseudoscalar current, and with mo and ccq the (unrenormalized) fermion mass and 
coupling constant. The second was a demonstration that because of the anomaly, Z2 
is no longer the multiplicative renormalization constant for the axial-vector vertex, 
as a result of the diagram drawn in Fig. 1(a) in which the AW triangle is joined to 
an electron line with two virtual photons. Instead, the axial- vector vertex is made 
finite by multiplication by the renormalization constant 

Za = Z2[l + liao/nf log(AVm2) + ...] , 

thus giving an answer to the question with which I started my investigation. Thirdly, 
as an application of this result, I showed that the anomaly leads, in fourth or- 
der of perturbation theory, to infinite radiative corrections to the current-current 
theory of t'^/i and ly^e scattering, but that this infinity can be cancelled between 
different fermion species by adding appropriate u^e and v^n scattering terms to 
the Lagrangian. This result is a forerunner of anomaly cancellation mechanisms in 
modern gauge theories. It is related to the fact, also discussed in my paper, that the 
asymptotic behavior of the AVV triangle diagram saturates the bound given by the 
Weinberg power counting rules, rather than being one power better as is the case 
for the AVVVV and higher loop diagrams, and has a leading asymptotic term that 
is a function solely of the external momenta. Finally, I also showed that a gauge 
invariant chiral generator still exists in the presence of the anomaly. Although not 
figuring in our subsequent discussion here, in its non-Abelian generalization this was 
relevant (as reviewed in Coleman, 1989) to later discussions of the U{1) problem 
in quantum chromodynamics (QCD), leading up to the solution given by 't Hooft 
(1976). 

No sooner was this part of my paper completed than Sidney Coleman arrived 
in Aspen from Europe, and told me that Bell and Jackiw (published as Bell and 
Jackiw, 1969) had independently discovered the anomalous behavior of the AVV 
triangle graph, in the context of a sigma model investigation of the Veltman (1967)- 
Suthcrland (1967) theorem stating that -k^ —>■ 77 decay is forbidden in a PCAC 
calculation. The Sutherland- Veltman theorem is a kinematic statement about the 
AVV three-point function, which asserts that if the momenta associated with the 
currents A, V, V are respectively q, ki,k2, then the requirement of gauge invariance 
on the vector currents forces the AVV vertex to be of order qkik2 in the external 
momenta. Hence when one applies a divergence to the axial-vector vertex and uses 
the standard PCAC relation (with the quark current J^^^ the analog of ^j^ ) 
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with the pion mass, (pT^ the pion field, and 7,^ the charged pion decay constant, 
one finds that the 7r° — > 77 matrix element is of order q^kik2, and hence vanishes 
in the soft pion limit 0. Bell and Jackiw analyzed this result by a perturbative 
calculation in the cr-model, in which PCAC is formally built in from the outset, and 
found a non-vanishing result for the tt^ 77 amplitude, which they traced back 
to the fact that the regularized AW triangle diagram cannot be defined to satisfy 
the requirements of both PCAC and gauge invariance. This constituted the "PCAC 
Puzzle" referred to in the title of their paper. They then proposed to modify the 
original a-model by adding further regulator fields with mass-dependent coupling 
constants in such a manner as to simultaneously enforce gauge invariance and PCAC, 
thus enforcing the Sut her land- Velt man prediction of a vanishing tt^ 77 decay 
amplitude. In the words of Bell and Jackiw in their paper, "It has to be insisted that 
the introduction of this mass dependence of coupling constants is not an arbitrary 
step in the PCAC context. If a regularization is introduced to define the theory, 
it must respect any formal properties which are to be appealed to." And again in 
concluding their paper, they stated "To the complaint that we have changed the 
theory, we answer that only the revised version embodies simultaneously the ideas 
of PCAC and gauge invariance." 

It was immediately clear to me, in the course of the conversation with Sidney 
Coleman, that introducing additional regulators to eliminate the anomaly would 
entail renormalizability problems in a meson scattering, and was not the correct way 
to proceed. However, it was also clear that Bell and Jackiw had made an important 
observation in tying the anomaly to the Sutherland-Vcltman theorem for tt*^ 77 
decay, and that I could use the sigma-model version of the anomaly equation to get 
a nonzero prediction for the 7r° — >■ 77 amplitude, with the whole decay amplitude 
arising from the anomaly term. I then wrote an Appendix to my paper (pages 2434- 
2438), clearly delineated from the manuscript that I had finished before Sidney's 
arrival, in which I gave a detailed rebuttal of the regulator construction, by showing 
that the anomaly could not be eliminated without spoiling either gauge-invariance 
or renormalizability. (in later discussions I added unitarity to this list, to exclude 
the possibility of canceling the anomaly by adding a term to the axial current with 
a d^/ {dxY singularity.) In this Appendix I also used an anomaly-modified PCAC 
equation 

with S a constant determined by the constituent fermion charges and axial-vector 
couplings, to obtain a PCAC formula for the 7r° 77 amplitude F'^ 

= -iaM2SV2/U . 

Although the axial anomaly, in the context of breakdown of the "pseudoscalar- 
pseudovector equivalence theorem", had in fact been observed much earlier, start- 
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ing with Fukuda and Miyamoto (1949) and Steinberger (1949) and continuing to 
Schwinger (1951), my paper broke new ground by treating the anomaly neither as 
a baffling calculational result, nor as a field theoretic artifact to be eliminated by 
a suitable regularization scheme, but instead as a real physical effect (breaking of 
classical symmetries by the quantization process) with observable physical conse- 
quences. 

This point of view was not immediately embraced by everyone else. After com- 
pleting my Appendix I sent Bell and Jackiw copies of my longhand manuscript, and 
an interesting correspondence ensued. In a letter dated August 25, 1968, Jackiw was 
skeptical whether one could extract concrete physical predictions from the anomaly, 
and whether one could augment the divergence of the axial-vector current by a def- 
inite extra electromagnetic contribution, as in the modified PCAC equation above. 
Bell, who was traveling, wrote me on Sept. 2, 1968, and was more appreciative of 
the possibility of using a modified PCAC to get a formula for the neutral pion de- 
cay amplitude, writing "The general idea of adding some quadratic electromagnetic 
terms to PCAC has been in our minds since Sutherland's r] problem. We did not 
see what to do with it." He also defended the approach he and Jackiw had taken, 
writing "The reader may be left with the impression that your development is con- 
tradictory to ours, rather than complementary. Our first observation is that the a 
model interpreted in a conventional way just does not have PCAC. This is already 
a resolution of the puzzle, and the one which you develop in a very nice way. We, 
interested in the cr-model only as exemplifying PCAC, choose to modify the con- 
ventional procedures, in order to exhibit a model in which general PCAC reasoning 
could be illustrated in explicit calculation." In recognition of this letter from John 
Bell, whom I revered, I added a footnote 15 to my manuscript saying "Our results 
do not contradict those of Bell and Jackiw, but rather complement them. The main 
point of Bell and Jackiw is that the a model interpreted in the conventional way, 
does not satisfy the requirements of PCAC. Bell and Jackiw modify the a model in 
such a way as to restore PCAC. We, on the other hand, stay within the conventional 
a model, and try to systematize and exploit the PCAC breakdown." This footnote, 
which contradicts statements made in the text of my paper, has puzzled a number 
of people; in retrospect, rather than writing it as a paraphrase of Bell's words, I 
should have quoted directly from Bell's letter. 

Following this correspondence, my paper was typed on my return to Princeton 
and was received by the Physical Review on Sept. 24, 1968. (Bell and Jackiw's 
paper, a CERN preprint dated July 16, 1968, was submitted to II Nuovo Cimento, 
and received by that journal on Sept. 11, 1968.) My paper was accepted along with 
a signed referee's report from Bjorken, stating "This paper opens a topic similar 
to the old controversies on photon mass and nature of vacuum polarization. The 
lesson there, as I (no doubt foolishly) predict will happen here, is that infinities 
in diagrams are really troublesome, and that if the cutoff which is used violates a 
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cherished symmetry of the theory, the results do not respect the symmetry. I will 
also predict a long chain of papers devoted to the question the author has raised, 
culminating in a clever renormalizable cutoff which respects chiral symmetry and 
which, therefore, removes Adler's extra term." Thus, acceptance of the point of 
view that I had advocated was not immediate, but only followed over time. In 1999, 
Bjorken was a speaker at my 60th birthday conference at the Institute for Advanced 
Study, and amused the audience by reading from his report, and then very graciously 
gave me his file copy, with an appreciative inscription, as a souvenir. 

The viewpoint that the anomaly determined the ir^ — > 77 decay amplitude had 
significant physical consequences. In the Appendix to my paper, I showed that the 
value "S* = ^ implied by the fractionally charged quark model gave a decay amplitude 
that was roughly a factor of 3 too small. More generally, I showed that a triplet 
constituent model with charges {Q,Q — 1,Q — 1) gave S = Q — ^, and so with 
integrally charged constituents {Q = or Q = 1) one gets an amplitude that agrees 
in absolute value, to within the expected accuracy of PCAC, with experiment. I 
noted in my paper that Q = 0, or 5 = — | corresponded to the case in which 
radiative corrections to weak interactions had been shown to be finite, but this 
choice for the sign of the 77 amplitude was soon to be ruled out. Over the 

next few months Okubo (1969) and Oilman (1969) wrote me letters accompanying 
preprints which demonstrated, by different methods, that the sign corresponding 
to a single positive integrally charged constituent going around the triangle loop 
agrees with experiment. Okubo also analyzed various alternative models for proton 
constituents, and pointed out that while some are excluded by the experimentally 
determined value of S, the integrally charged Maki (1964)-IIara (1964) single triplet 
model (the model that I had considered in my Appendix, but now with Q = 1), and 
the corresponding integrally charged three triplet model of Han and Nambu (1965) 
(see also Tavkhelidze (1965), Miyamoto (1965), and Nambu (1965)), are both in 
accord with the empirical value 5 ~ ^. In a conference talk a year later, in September 
1969 (Adler, 1970a, R17) I reviewed the subject of the anomaly calculation of neutral 
pion decay, as developed in the papers that had appeared during the preceding year. 

The work just described gave the first indications that neutral pion decay pro- 
vides empirical evidence that can discriminate between different models for hadronic 
constituents. The correct interpretation of the fact that 5" ~ i came only later, when 
what we now call the "color" degree of freedom was introduced in the seminal pa- 
pers of Bardeen, Fritzsch, and Gell-Mann (1972; reprinted as hep-ph/0211388) and 
Fritzsch and Gell-Mann (1971/1972; reprinted as hep-ph/0301127). These papers 
used my calculation of — > 77 decay as supporting justification for the tripling 
of the number of fractionally charged quark degrees of freedom, thus increasing the 
theoretical value of S for fractionally charged quarks from ^ to ^. The paper of 
Bardeen, Fritzsch, and Gell-Mann also pointed out that this tripling would show up 
in a measurement of R, the ratio of hadronic to muon pair production in electron 
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positron collisions, while noting that "Experiments at present are too low in energy 
and not accurate enough to test this prediction, but in the next year or two the 
situation should change." , as indeed it did. 

Before leaving the subject of the early history of the anomaly and its antecedents, 
perhaps this is the appropriate place to mention the paper of Johnson and Low 
(1966), which showed that the Bjorken (1966)-Johnson-Low (1966) (BJL) method 
of identifying formal commutators with an infinite energy limit of Feynman diagrams 
gives, in significant cases, results that differ from the naive field-theoretic evaluation 
of these commutators. This method was later used by Jackiw and Johnson (1969) 
and by Boulware and myself (Adler and Boulware, 1969, R18) to show that the AVV 
axial anomaly can be reinterpreted in terms of anomalous commutators. This line 
of investigation, however, did not readily lend itself to a determination of anomaly 
effects beyond leading order. For example, I still have in my files an unpublished 
manuscript (circa 1966) attempting to use the BJL method to tackle a simpler prob- 
lem, that of proving that the Schwinger term in quantum electrodynamics (QED) is 
a c-number to all orders of perturbation theory. I believe that this result is true (and 
it may well have been proved by now using operator product expansion methods), 
but I was not able at that time to achieve sufficient control of the BJL limits of high 
order diagrams with general external legs to give a proof. (See also remarks on this 
in Chapter 4.) 



Anomaly Nonrenormalization 

We are now ready to address the issue of the determination of anomalies beyond 
leading order in perturbation theory. Before the neutral pion low energy theorem 
could be used as evidence for the charge structure of quarks, one needed to be 
sure that there were no perturbative corrections to the anomaly and the low energy 
theorem following from it. As I noted above, the fcrmion loop-wise argument that 
I used in my original treatment left me convinced that only the lowest order AVV 
diagram would contribute to the anomaly, but this was not a proof. This point of 
view was challenged in the article by Jackiw and Johnson (1969), received by the 
Physical Review on Nov. 25, 1968, who stated "Adler has given an argument to the 
end that there exist no higher-order effects. He introduced a cutoff, calculated the 
divergence, and then let the cutoff go to infinity. This is seen in the present context 
to be equivalent to the second method above. However, we believe that this method 
may not be reliable because of the dependence on the order of limits." And in their 
conclusion, they stated "In a definite model the nature of the modification (to the 
axial-vector current divergence equation) can be determined, but in general only 
to lowest order in interactions." This controversy with Jackiw and Johnson was the 
motivation for a more thorough analysis of the nonrenormalization issue undertaken 
by Bill Bardeen and myself in the fall and winter of 1968-1969 (Adler and Bardeen, 
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1969, R19) and was cited in the "Acknowledgments" section of om paper, where we 
thanked "R. Jackiw and K. Johnson for a stimulating controversy which led to the 
writing of this paper." 

The paper with Bardeen approached the problem of nonrenormalization by two 
different methods. We first gave a general constructive argument for nonrenormal- 
ization of the anomaly to all orders, in both quantum electrodynamics and in the 
(T-model in which PCAC is canonically realized, and we then backed this argu- 
ment up with an explicit calculation of the leading order radiative corrections to 
the anomaly, showing that they cancelled among the various contributing Feynman 
diagrams. The strategy of the general argtiment was to note that since the anomaly 
equations written above involve unrenormalized fields, masses, and coupling con- 
stants, these equations are well defined only in a cutoff field theory. Thus, for both 
electrodynamics and the cr-model, we constructed cutoff versions by introducing 
photon or cr-meson regulator fields with mass A. (This was simple for the case of 
electrodynamics, but more difficult, relying heavily on Bill Bardeen's prior experi- 
ence with meson field theories, in the case of the a-model.) In both cases, the cutoff 
prescription allows the usual renormalization program to be carried out, expressing 
the unrenormalized quantities in terms of renormalized ones and the cutoff A. In 
the cutoff theories, the fermion loop-wise argument I used in my original anomaly 
paper is still valid, because regulating boson propagators does not alter the chiral 
symmetry properties of the theory, and thus it is straightforward to prove the va- 
lidity of the anomaly equations involving unrenormalized quantities to all orders of 
perturbation theory. 

Taking the vacuum to two 7 matrix clement of the anomaly equations, and 
applying the Sutherland- Veltman theorem, which asserts the vanishing of the matrix 
element of d^^j^ at the special kinematic point = 0, Bardeen and I then got exact 
low energy theorems for the matrix elements (27|2imoJ^|0) (in electrodynamics) 
and (27|(/7rM^/\/2)</'7r|0) (in the fi-model) of the "naive" axial- vector divergence 
at this kinematic point, which were given by the negative of the corresponding 
matrix element of the anomaly term. However, since we could prove that these 
matrix elements are finite in the limit as the cutoff A approaches infinity, this in 
turn gave exact low energy theorems for the renormalized, physical matrix elements 
in both cases. One subtlety that entered into the all orders calculation was the 
role of photon rescattering diagrams connected to the anomaly term, but using 
gauge invariance arguments analogous to those involved in the Sutherland-Veltman 
theorem, we were able to show that these diagrams made a vanishing contribution to 
the low energy theorem at the special kinematic point q'^ = 0. Thus, my paper with 
Bardeen provided a rigorous underpinning for the use of the tt^ —>■ 77 low energy 
theorem to study the charge structure of quarks. 

In our explicit second order calculation, we calculated the leading order radiative 
corrections to this low energy theorem, arising from addition of a single virtual pho- 
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ton or virtual fi-meson to the lowest order diagram. We did this by two methods, 
one involving a direct calculation of the integrals, and the other (devised by Bill 
Bardeen) using a clever integration by parts argument to bypass the direct calcula- 
tion. Both methods gave the same answer: the sum of all the radiative corrections is 
zero, as expected from our general nonrenormalization argument. We also traced the 
contradictory results obtained in the paper of Jackiw and Johnson to the fact that 
these authors had studied an axial- vector current (such as V'TmTsV' the a- model) 
that is not made finite by the usual renormalizations in the absence of electromag- 
netism; as a consequence, the naive divergence of this current is not multiplicatively 
renormalizable. As we noted in our paper, "In other words, the axial-vector current 
considered by Jackiw and Johnson and its naive divergence are not well-defined ob- 
jects in the usual renormalized perturbation theory; hence the ambiguous results 
which these authors have obtained are not too surprising." Our result of a defi- 
nite, unrcnormalized low energy theorem, we noted, came about because "In each 
model wc have studied a particular axial-vector current: in spinor electrodynamics, 
the usual axial-vector current ... and in the a model the Polkinghorne (1958a, b) 
axial- vector current ... which, in the absence of electromagnetism, obeys the PCAC 
condition." It is these axial-vector currents that obey a simple anomaly equation to 
all orders in perturbation theory, and which give an exact, physically relevant low 
energy theorem for the naive axial-vector divergence. 

This paper with Bill Baxdeen should have ended the controversy over whether 
the anomaly was renormalized, but it didn't. Johnson pointed out in an unpublished 
report that since the anomaly is mass-independent, it should be possible to calculate 
it in masslcss electrodynamics, for which the naive divergence 2i7noj^ vanishes and 
the divergence of the axial-vector current directly gives the anomaly. Moreover, in 
massless electrodynamics there is no need for mass renormalization, and so if one 
chooses Landau gauge for the virtual photon propagator, the second order radiative 
correction calculation becomes entirely ultraviolet finite, with no renormalization 
counter terms needed. Such a second order calculation was reported by Sen (1970), 
a Johnson student, who claimed to find nonvanishing second order radiative correc- 
tions to the anomaly. However, the calculational scheme proposed by Johnson and 
used by Sen has the problem that, while ultraviolet finite, there are severe infrared 
divergences, which if not handled carefully can lead to spurious results. After a long 
and arduous calculation (Adler, Brown, Wong, and Young, 1971) my collaborators 
and I were able to show that the zero mass calculation, when properly done, also 
gives a vanishing second order radiative correction to the anomaly. This confirmed 
the result I had found with Bardeen, which had by then also been confirmed by dif- 
ferent methods in the mo 7^ theory in papers of Abers, Dicus, and Teplitz (1971) 
and Young, Wong, Gounaris, and Brown (1971). 

Even this was not the end of controversies over the nonrenormalization theo- 
rem, as discussed in detail in my review Adler (2004a) that focuses specifically on 
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anomaly nonrenormalization. Suffice it to say here that no objections raised have 
withstood careful analysis, and there is now a detailed understanding of anomaly 
nonrenormalization both by perturbative methods, and by non-perturbative meth- 
ods proceeding from the Callan-Symanzik equations. There is also a detailed under- 
standing of anomaly nonrenormalization in the context of supersymmetric theories, 
where initial apparent puzzles are now resolved. 



Point Splitting Calculations of the Anomaly 

At this point let me backtrack, and discuss the role of point-splitting methods 
in the study of the Abelian electrodynamics anomaly. In the present context, 
point-splitting was first used in the discussion given by Schwingcr (1951) of the 
pseudoscalar-pseudovector equivalence theorem, to be described in more detail 
shortly. Almost immediately following circulation of the seminal anomaly preprints 
in the fall of 1968, Hagen (1969, received Sept. 24, 1968, and a letter to me dated 
Oct. 16, 1968), Zumino (1969, and a letter to me dated Oct. 7, 1968), and Brandt 
(1969, received Dec. 17, 1968, and a letter to nic dated Oct. 16, 1968) all rcdcrived 
the anomaly formula by a point-splitting method. Independently, a point-splitting 
derivation of the anomaly was given by Jackiw and Johnson (1969, received 25 
November, 1968), who explicitly made the connection to Schwinger's earlier work 
(Johnson was a Schwinger student, and was well acquainted with Schwinger's body 
of work). The point of all of these calculations is that the anomaly can be derived 
by formal algebraic use of the equations of motion, provided one redefines the sin- 
gular product ijj{x)^n'y5'tp{x) appearing in the axial-vector current by the point-split 
expression 

PX 

lim V(ic')7M75exp[— ie / dx^Bxltpix) , 

and takes the limit x' — > x at the end of the calculation. 

Responding to these developments, I appended a "Note added in proof" to my 
anomaly paper, mentioning the four field-theoretic, point-splitting derivations that 
had subsequently been given, and adding "Jackiw and Johnson point out that the 
essential features of the field-theoretic derivation, in the case of external electro- 
magnetic fields, are contained in J. Schwinger, Phys. Rev. 82, 664 (1951)". What 
to me was an interesting irony emerged from learning of the connection between 
anomalies and the famous Schwinger (1951) paper on vacuum polarization. I had in 
fact read Section II and the Appendices of the 1951 paper, when Alfred Goldhaber 
and I, during our senior year at Harvard (1960-61), did a reading course on quantum 
electrodynamics with Paul Martin, which focused on papers in Schwinger's reprint 
volume (Schwinger, 1958). Paul had told us to read the parts of the Schwinger pa- 
per that were needed to calculate the VV vacuum polarization loop, but to skip the 
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rest as being too technical. Reading Section V of Schwinger's paper brought back 
to mind a brief, forgotten conversation I had had with Jack Steinberger, who was 
Director of the Varenna Summer School in 1967. Steinberger had told me that he 
had done a calculation on the pseudovector-pseudoscalar equivalence theorem for 
-y^^ but had gotten different answers in the two cases; also that Schwinger had 
claimed to reconcile the answers, but that he (Steinberger) couldn't make sense out 
of Schwinger's argument. Jack had urged me to look at it, which I never did until 
getting the Jackiw- Johnson preprint, but in retrospect everything fell into place, 
and the connection to Schwinger's work was apparent. 

This now brings me to the question, did Schwinger's paper constitute the dis- 
covery of the anomaly? Both Jackiw, in his paper with Johnson, and I were careful 
to note the connection between Schwinger's (1951) paper and the point-splitting 
derivations of the anomaly, once it was called to our attention. However, recently 
some of Schwinger's former students have gone further, arguing that Schwinger was 
the discoverer of the anomaly and that my paper and that of Bell and Jackiw were 
merely a "rediscovery" of a previously known result. I believe that this claim goes 
beyond the published record of what is in Schwinger's paper, as analyzed in detail in 
Sec. 2.3 and Appendix A of my review Adler (2004a). Stated briefly, Schwinger's cal- 
culation was devoted to making the pseudovcctor calculation give the same non-zero 
answer as the pseudoscalar one, and what Schwinger calls the redefined axial-vector 
divergence is in fact not the divergence of the gauge-invariant axial-vector current, 
but rather the axial- vector current divergence minus the anomaly. In other words, 
Schwinger's calculation effectively transposes the anomaly term to the left-hand 
side of the anomaly equation, so that what he evaluates is the effective Lagrangian 
arising from the left-hand side of the equation 

which then necessarily gives the same result as calculation of an effective Lagrangian 
from the right-hand side, which is pseudoscalar coupling. There is no gauge-invariant 
axial-vector current for which the combination on the left-hand side is the diver- 
gence, but as shown in Eqs. (58) and (59) of R16, there is a gauge-non-invariant 
axial-vector current which has this divergence. 

The use of a point-splitting method was of course important and fruitful, and 
in retrospect, the axial anomaly is hidden within Schwinger's calculation. But 
Schwinger never took the crucial step of observing that the axial-vector current 
matrix elements cannot, in a renormalizable quantum theory, be made to satisfy 
all of the expected classical symmetries. And more specifically, he never took the 
step of defining a gauge-invariant axial-vector current by point splitting, which has 
a well-defined anomaly term in its divergence, with the anomaly term completely 
accounting for the disagreement between the pseudoscalar and pseudovector calcu- 
lations of neutral pion decay. So I would say that although Schwinger took steps in 
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the right direction, particularly in noting the utility of point-splitting in defining the 
axial-vector current, his 1951 paper obscured the true physics and does not mark 
the discovery of the anomaly. This happened only much later, in 1968, and led to a 
flurry of activity by many people. My view is supported, I believe, by the fact that 

Schwinger's calculation seemed arcane, even to people (like Steinberger) with whom 
he had talked about it and to colleagues familiar with his work, and exerted no in- 
fluence on the field until after preprints on the seminal work of 1968 had appeared. 



The Non-Abelian Anomaly, Its Nonrenormalization and Geometric 
Interpretation 

Since in the chiral limit the AVV triangle is identical to an AAA triangle (as is 

easily seen by an argument involving anticommutation of a 75 around the loop), I 
knew already in unpublished notes dating from the late summer of 1968 that the 
AAA triangle would also have an anomaly; a similar observation was also made 
by Gerstein and Jackiw (1969). Prom fragmentary calculations begun in Aspen I 
suspected that higher loop diagrams might have anomalies as well, so after the 
nonrenormalization work with Bill Bardeen was finished I suggested to Bill that he 
work out the general anomaly for larger diagrams. (I was at that point involved 
in other calculations with Wu-Ki Tung, on the perturbative breakdown of scaling 
formulas such as the Callan-Gross relation, to be discussed shortly.) I showed Bill 
my notes, which turned out to be of little use, but which contained a very pertinent 
remark by Roger Dashcn that including charge structure (which I had not) would 
allow a larger class of potentially anomalous diagrams. Within a few weeks. Bill 
carried out a brilliant calculation, by point-splitting methods, of the general anomaly 
in both the Abelian and the non-Abelian cases (Bardeen, 1969). Expressed in terms 
of vector and axial-vector Yang-Mills field strengths 

Ff{x) = d^Vix) - d'^Vix) - i[V''{x),V''{x)] - i[A^'{x),A''{x)] , 
F7(x) = d^'A'^ix) - ^''A^'{x) - i[V''{x),A''{x)] - i[A''(x),V''{x)] , 

Bardeen's result takes the form 

d'^J^^{x) = normal divergence term 

+ (l/47r2)e^,,,trj[AS[(l/4)Fr(x)Fr(^) + {m)FT {^)Fa i^) 
+ {2/?,)iAi'{x)A''{x)F^''{x) + {2/2)iF!^''{x)A''{x)A^{x) 

+ {2/2>)iA''{x)F^%x)A^{x) - (8/3)A'^(x)A'^(a;)A^(x)A^(x)] , 

with tr/ denoting a trace over internal degrees of freedom, and the internal 
symmetry matrix associated with the axial-vector external field. In the Abelian case, 
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with trivial internal symmetry structure, the terms involving two or three factors of 
vanish by antisymmetry of e^naT, and there are only AVV and AAA triangle 
anomalies. When there is non-trivial internal symmetry or charge structure, there 
are anomalies associated with the box and pentagon diagrams as well, confirming 
Dashen's intuition mentioned earlier. Bardeen notes that whereas the triangle and 
box anomalies result from linear divergences associated with these diagrams, the 
pentagon anomalies arise not from linear divergences, but rather from the definition 
of the box diagrams to have the correct vector current Ward identities. Bardeen also 
notes, in his conclusion, another prophetic remark of Dashcn, to the effect that the 
pentagon anomalies should add anomalous terms to the PCAC low energy theorems 
for five pion scattering; I shall return to this shortly. 

There are two distinct lines of argument leading to the conclusion that the non- 
Abelian chiral anomaly also has a nonrenormalization theorem, and is given exactly 
by Bardeen's leading order calculation. The first route parallels that used in the 
Abelian case, involving variously a loop-wise regulator construction, explicit fourth 
order calculation, and an argument using the Callan-Symanzik equations; for de- 
tailed references, see Adler (2004a). The conclusion in all cases is that the Adler- 
Bardeen theorem extends to the non- Abelian case. Heuristically, what is happening 
is that except for a few small onc-fcrmion loop diagrams, non-Abelian theories, just 
like Abelian ones, are made finite by gauge invariant regularization of the gluon 
propagators. But this regularization has no effect on the chiral properties of the 
theory, and therefore does not change its anomaly structure, which can thus be de- 
duced from the structure of the few small fermion loop diagrams for which naive 
classical manipulations break down. 

The second route leading to the conclusion that the non-Abelian anomaly is non- 
renormalized might be termed "algebraic/geometrical", and consists of two steps. 
The first step consists of a demonstration that the higher order terms in Bardeen's 
non-Abelian formula are completely determined by the leading, Abelian anomaly. 
During a summer visit to Fermilab in 1971, I collaborated with Ben Lee, Sam 
Treiman, and Tony Zee (Adler, Lee, Trciman, and Zee 1971, R20) in a calcula- 
tion of a low energy theorem for the reaction 7-I-7— ^■Tr-I-Tr-I-Trin both the neutral 
and charged pion cases. This was motivated in part by discrepancies in calculations 
that had just appeared in the literature, and in part by its relevance to theoretical 
unitarity calculations of a lower bound on the —>■ ji"^ ii~ decay rate. Using PCAC, 
we showed that the fact that the 7 -I- 7 — Stt matrix elements vanish in the limit 
when a final vr'^ becomes soft, together with photon gauge invariance, relates these 
amplitudes to the matrix elements F'" for 7 -|- 7 tt^ and F'^'^ for 7 ^ vr'^ -|- tt"*" -|- tt" , 
and moreover, gives a relation between the latter two matrix elements. 



eF- 




V2 
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Thus all of the matrix elements in question are uniquely determined by F'^, which 
itself is determined by the AVV anomaly calculation. An identical result for the same 
reactions was independently given by Terent'ev (Terentiev) (1971). In the meantime, 
in a beautiful formal analysis, Wess and Zumino (1971) showed that the current 
algebra satisfied by the flavor SU{3) octet of vector and axial-vector currents implies 
integrability or "consistency" conditions on the non-Abelian axial-vector anomaly, 
which are satisfied by the Bardeen formula, and conversely, that these constraints 
uniquely imply the Bardeen structure up to an overall factor, which is determined 
by the Abelian AVV anomaly. By introducing an auxiliary pseudoscalar field, Wess 
and Zumino were able to write down a local action obeying the anomalous Ward 
identities and the consistency conditions. (There is no corresponding local action 
involving just the vector and axial-vector currents, since if there were, the anomalies 
could be eliminated by a local counterterm.) Wess and Zumino also gave expressions 
for the processes 7 — > Stt and 27 — > Stt discussed by Adler et al. and Terentiev, as 
well as giving the anomaly contribution to the five pseudoscalar vertex. The net 
result of these three simultaneous pieces of work was to show that the Bardeen 
formula has a rigidly constrained structure, up to an overall factor given by the 
TT^ 77 decay amplitude. 

The second step in the "algebraic/geometric" route to anomaly renormaliza- 
tion is a celebrated paper of Witten (1983), which shows that the Wess-Zumino 
action has a representation as the integral of a fifth rank antisymmetric tensor (con- 
structed from the auxiliary pseudoscalar field) over a five-dimensional disk of which 
four- dimensional space is the boundary. In addition to giving a new interpretation of 
the Wess Zumino action T, Wittcn's argument also gave a constraint on the overall 
factor in F that was not determined by the Wess-Zumino consistency argument. 
Witten observed that his construction is not unique, because a closed five-sphere 
intersecting a hyperplane gives two ways of bounding the four-sphere along the equa- 
tor with a five dimensional hemispherical disk. Requiring these two constructions 
to give the same value for exp(ir), which is the way the anomaly enters into a 
Feynman path integral, requires integer quantization of the overall coefficient in the 
Wess-Zumino-Witten action. This integer can be read off from the AVV triangle 
diagram, and for the case of an underlying color SU {Nc) gauge theory turns out to 
be just Nc, the number of colors. 

To summarize, the "algebraic/geometric" approach shows that the Bardeen 
anomaly has a unique structure, up to an overall constant, and moreover that this 
overall constant is constrained by an integer quantization condition. Hence once the 
overall constant is fixed by comparison with leading order perturbation theory (say 
in QCD), it is clear that this result must be exact to all orders, since the presence 
of renormalizations in higher orders of the strong coupling constant would lead to 
violations of the quantization condition. 

The fact that non-Abelian anomalies are given by an overall rigid structure 
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has important implications for quantum field theory. For example, the presence of 
anomalies spoils the renormalizability of non-Abelian gauge theories and requires 
the cancellation of gauged anomalies between different fermion species (see Gross 
and Jackiw (1972), Bouchiat, Iliopoulos, and Meyer (1972), and Weinberg (1973)), 
through imposition of the condition tr{TQ,T^}T^ = for all a,/3, 7, with the 
coupling matrices of gauge bosons to left-handed fcrmions. The fact that anomalies 
have a rigid structure then implies that once these anomaly cancellation conditions 
are imposed for the lowest order anomalous triangle diagrams, no further conditions 
arise from anomalous square or pentagon diagrams, or from radiative corrections 
to these leading fermion loop diagrams. Other places where the one-loop geometric 
structure of non-Abelian anomalies enters are in instanton physics, and in the 't 
Hooft anomaly matching conditions. These and other chiral anomaly applications 
are discussed in more detail in my review Adler (2004a), and also in my Encyclopedia 
of Mathematical Physics article Adler (2004b). Both of these sources give extensive 
references to recent review articles and books on anomalies, which update the 1970 
reviews given in my Brandeis lectures (Adler, 1970b) and in Jackiw's Brookhaven 
lectures (Jackiw, 1970). 



Perturbative Corrections to Scaling 

While finishing the paper with Bardeen on anomaly nonrenormalization, I had cm- 
barked on a different set of perturbative calculations with Wu-Ki Tung; these be- 
came a forerunner of a different kind of "anomaly" , the anomalous scaling observed 
in deep inelastic electron and neutrino scattering. Our starting point was the ques- 
tion of whether applications of the Bjorken (1966) limit technique, which assumed 
that the asymptotic behavior of time-ordered products is given by the "naive" or 
free field theory equal time commutator, would be modified in perturbation theory. 
Strong hints in this direction had been given in a paper of Johnson and Low (1966), 
which showed that the "Bjorken- Johnson-Low" limit can produce anomalous com- 
mutators, and related results were also obtained in an earlier paper of Vainshtein 
and loffe (1967); our aim was to do calculations focusing on several physically impor- 
tant applications not covered in this previous work. These were the calculation by 
Bjorken (1966) of the radiative corrections to /3-dccay, the Bjorken (1967) backward- 
neutrino-scattering asymptotic sum rule, and the Callan-Gross (1969) relation re- 
lating the ratio of the longitudinal to transverse deep inelastic electron scattering 
cross sections to the constitution of the electric current, with the latter an applica- 
tion both of the Bjorkcn-Johnson-Low limit method, and of the later proposal by 
Bjorken (1969) of scaling of the deep inelastic structure functions. 

For our test model, we considered an SU{3) triplet of spin-1/2 particles bound 
by exchange of a massive singlet gluon, which we took as either a vector, scalar, 
or pseudoscalar. The results of the vector exchange calculation, to leading order of 
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perturbation theory, were reported in Adlcr and Tung (1969), R21, while additional 
leading order results in the scalar and pseudoscalar gluon cases, and some fourth 
order results, were given in the follow-up paper Adler and Tung (1970), R22. We 
concluded that the Callan-Gross relation for spin-1/2 quarks, which asserts the van- 
ishing of (f'(JL{(f', ui) for large (f' with fixed scaling variable w, breaks down in leading 
order of perturbation theory. A similar conclusion was also reached by Jackiw and 
Preparata (1969a,b), whose first paper appears in the same issue of Physical Review 
Letters as our paper R21. Tung and I related the breakdown of the Callan-Gross 
relation to a corresponding breakdown of Bjorken's backward neutrino sum rule. 
We also showed that the certain current commutators receive a systematic pattern 
of logarithmic asymptotic corrections, and calculated the leading perturbative cor- 
rection to the logarithmically divergent part of the radiative corrections to (3 decay. 
Tung (1969), while still at the Institute, and Jackiw and Preparata (1969c), went 
on to carry out general analyses of the range of validity and breakdown of the 
Bjorken-Johnson-Low limit in perturbation theory. 

These papers had a number of implications for subsequent developments. The 
logarithmic deviations from the Callan-Gross relation were soon understood in a 
more systematic way through the Wilson (1969) operator product expansion and 
the Callan (1970)-Symanzik (1970) equations, which gave anomalous dimensions in 
accord with the leading order results obtained by Tung and me and by Jackiw and 
Preparata, and with the fourth order results obtained by Tung and me in R22; for a 
discussion of this, see Beg (1975). The fact that perturbative field theory gives strong 
violations of scaling led to a skepticism as to whether field theory could describe 
the strong interactions at all. For example, Fritzsch and Gell-Mann (1971/1972), in 
their long paper on "Light Cone Current Algebra" , remarked that "The renormalized 
perturbation theory, taken term by term, reveals various pathologies in commutators 
of currents. Not only are there in each order logarithmic singularities on the light 
cone, which destroy scaling, and violations of the rule that (Jl/^^t ^ in the Bjorken 
limit, but also a careful perturbation theory treatment show the existence of higher 
singularities on the light cone..." This was one of their motivations for introducing 
the light cone algebra, which abstracted from field theory algebraic relations that 
led to scaling and parton model results, with the field theory itself being discarded. 

At the same time, there were also thoughts that a renormalization group fixed 
point in field theory might provide a remedy. In the same article, Fritzsch and 
Gell-Mann noted that in the context of a singlet vector gluon theory, "we must 
imagine that the sum of perturbation theory yields the special case of a 'finite 
vector theory'^"^ [reference to Gell-Mann and Low, and Baker and Johnson] if we 
are to bring the vector gluon theory and the basic algebra into harmony." Quite 
independently, in a conference talk at Princeton that I gave in October of 1971 
(published considerably later as Adler (1974), R23), in Section 2.4, on "Questions 
raised by the breakdown of the BJL limit", I made the remark "Can one make a 
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consistent calculational scheme in which Bjorken Hmits, the Callan-Gross relation 
and scaling are all valid? This is a real challenge to theorists. ..Perhaps a successful 
approach would involve summation of perturbation theory graphs plus use of the 
Gell-Mann-Low eigenvalue condition (see sect. 3)." (l made these comments at 
just the time when I was working on a possible eigenvalue condition in quantum 
electrodynamics, growing out of the work of Gell-Mann and Low, and Johnson, 
Baker, and Willey, as described below in Chapter 4. The relevance of an eigenvalue 
to power law behavior was also pointed out in the papers of Callan (1972) and of 
Christ, Hasslacher, and Mueller (1972), which I included as references when I edited 
my 1971 conference talk in the fall of 1972.) However, in the field theories then 
under consideration, there was an obstacle to realizing this idea. As I noted in Sec. 
3 of my Princeton talk, for singlet gluon theories the renormalization group methods 
suggested either no simple scaling behavior (if there were no renormalization group 
fixed point at which the P function had a zero) , or power law deviations from scaling 
of the form {q'^)~^ (if there were a fixed point at a nonzero coupling value Aq where 
P vanished, with 7 the value of the anomalous dimension at the fixed point). Since 
in a strong coupling theory 7 would be expected to be large at the fixed point, power 
law deviations from scaling looked to be too large to agree with experiment. 

It took another eighteen months for this obstacle to be overcome. Three de- 
velopments were involved: the introduction of the modern form of "color" as a 
tripling of the fractionally charged quark degrees of freedom by Bardeen, Pritzsch, 
and Gell-Mann (1972), the non-Abelian gauging of this form of color by Pritzsch 
and Gell-Mann (1972), and finally, in line with Cell-Mann's dictum "Nature reads 
the books of free field theory", a search for field theories that would have almost 
free behavior in the scaling limit. The conclusion of this search, the discovery of the 
asymptotic freedom of non-Abelian gauge theories and its implications by Gross, 
Politzer, and Wilczek, in the end proved a realization of the field-theoretic route 
that been contemplated by various people in 1971. In asymptotically free theories, 
because the renormalization group fixed point (the Gell-Mann-Low eigenvalue) is 
at zero coupling, where the anomalous dimension 7 vanishes, the deviations from 
scaling are not powers of q^, but rather only powers of logq'^, with exponents that 
can be calculated in leading order of perturbation theory. Thus the deviations from 
scaling predicted by non-Abelian gauge theories, and specifically by quantum chro- 
modynamics (QCD) as the theory of the strong interactions, are much weaker than 
would be expected for singlet gluon theories, and are compatible with experiment. 

Returning briefly to the calculations that Tung and I did, our results for the 
radiative corrections to /3-decay in the singlet vector gluon model turned out later 
to have applications in the QCD context. They can be converted to the realistic case 
of the octet gluon of QCD by multiplication by a color factor, as discussed in the 
review of Sirlin (1978), and so have become part of the technology for calculating 
radiative corrections to weak processes. 
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Trace Anomalies to All Orders 

In an influential paper Wilson (1969) proposed the operator product expansion, 
incorporating ideas on the approximate scale invariance of the strong interactions 
suggested by Mack (1968). As one of the applications of his technique, Wilson dis- 
cussed 7r° — 27 decay and the axial-vector anomaly from the viewpoint of the short 
distance singularity of the coordinate space AW three-point function. Using these 
methods, Crewther (1972) and Chanowitz and Ellis (1972) investigated the short 
distance structure of the three-point function OV^Vi,, with 6 = 6'^ the trace of the 
energy-momentum tensor, and concluded that this is also anomalous, thus confirm- 
ing earlier indications of a perturbative trace anomaly obtained in a study of broken 
scale invariance by Coleman and Jackiw (1971). Letting A^jy(p) be the momentum 
space OV^Vu three point function, and 11^;^ be the corresponding V^jVu two-point 
function, the naive Ward identity Af^^{p) = (2 — Pfjd / dpcr)^^v{p) is modified to 



Thus, for QED, with a single fermion of charge e, the anomaly term is 
— [2a/(37r)](pjuPj,— Ty^jyP^). In a subsequent paper, Chanowitz and Ellis (1973) showed 
that the fourth order trace anomaly can be read off directly from the coefficient of 
the leading logarithm in the asymptotic behavior of Il^jy (p) , giving to next order an 
anomaly coefficient — 2Q;/(37r) — c? j(2'n'^). Thus, their fourth order argument indi- 
cated a direct connection between the trace anomaly and the renormalization group 
/? function. 

My involvement with trace anomalies began roughly five years later, when Phys- 
ical Review sent me for refereeing a paper by Iwasaki (1977). In this paper, which 
noted the relevance to trace anomalies, Iwasaki proved a kinematic theorem on the 
vacuum to two photon matrix element of the trace of the energy-momentum ten- 
sor, that is an analog of the Sutherland-Veltman theorem for the vacuum to two 
photon matrix element of the divergence of the axial- vector current. Just as the 
latter has a kinematic zero at = 0, Iwasaki showed that the kinematic struc- 
ture of the vacuum to two photon matrix element of the energy-momentum tensor 
implies, when one takes the trace, that there is also a kinematic zero at = 0, 
irrespective of the presence of anomalies (just as the Sutherland-Veltman result 
holds in the presence of anomalies). Reading this article suggested the idea that just 
as the Sutherland-Veltman theorem can be used as part of an argument to prove 
nonrenormalization of the axial-vector anomaly, Iwasaki's theorem could be used 
to analogously calculate the trace anomaly to all orders. (In addition to writing a 




with the trace anomaly coefficient R given by 




February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



52 Adventures in Theoretical Physics 

favorable report on Iwasaki's paper, I invited him to spend a year at the IAS, which 
he did during the 1977-78 academic year.) During the spring of 1976 I wrote an 
initial preprint attempting an all orders calculation of the trace anomaly in quan- 
tum electrodynamics, but this had an error pointed out to me by Baqi Beg. Over 
the summer of 1976 I then collaborated with two local postdocs, John Collins (at 
Princeton) and Anthony Duncan (at the Institute), to work out a corrected ver- 
sion (Adler, Collins, and Duncan, 1977, R24). Collins and Duncan simultaneously 
teamed up with another Institute postdoc, Satish Joglekar, to apply similar ideas 
to quantum chromodynamics, published as Collins, Duncan, and Joglekar (1977), 
and independently the same result for QCD was obtained by N. K. Nielsen (1977). 
Similar results were given in a preprint of Minkowski (1976), which grew out of 
discussions in the Gell-Mann group at Cal Tech in which the role of the (3 function 
in the trace anomaly formula, and its implications for generating the scale of the 
strong interactions, were appreciated (C. T. Hill, private communication, 2005, and 
P. Minkowski, private communication, 2005). 

In the simpler case of QED, the argument based on Iwasaki's theorem is given 
in Section II of R24. The basic idea is to use Iwasaki's result for the vacuum to 
two photon matrix clement of the trace of the energy momentum tensor, together 
with expressions for the electron to electron and the vacuum to two photon matrix 
elements of the "naive" trace niQip^ given by application of the Callan-Symanzik 
equations. The final result for the trace is given by 

e^ = [l + 5(a)]moV^V' + \p{a)N[F^,F^''] + ... , 

with N[ ] an explicitly defined subtracted operator, with ... indicating terms that 
vanish by the equations of motion, and with 6 (a) and (3(a) the rcnormalization 
group functions defined hy 1 + 6 (a) = [m / mQ)dmQ / dm and /3(a) = {m/ a)da/ dm. 
The first two terms in the power series expansion of the coefficient of the Fx^F^"^ 
term in the trace agree with the fourth-order calculation of Chanowitz and Ellis. The 
trace equation in QCD has a similar structure, again with the f3 function appearing 
as the anomaly coefficient. The fact that the trace anomaly coefficient is given by the 
appropriate /3 function extends to the super symmetric case, and leads to interesting 
issues that are reviewed in the final section of Adler (2004a). The appearance of 
the /3 function in the anomaly coefficient has also played a role in the inference of 
the structure of effective Lagrangians from the form of the trace anomaly; see, for 
example, Pagels and Tomboulis (1978) for an application to QCD, and Veneziano 
and Yankielowicz (1982) for an application to super symmetric Yang-Mills theory. 
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4. Quantum Electrodynamics 



Introduction 

My interest in a detailed study of quantum electrodynamics (QED) began during 
my visit to Cambridge, U.K. in tiie spring of 1968, when I found the anomalous 
properties of the axial-vector triangle diagram discussed in Chapter 3. This started 
me thinking more generally about the properties of fermion loop diagrams, and 
in particular I wondered whether such diagrams in quantum electrodynamics could 
lead to an eigenvalue condition for the electric charge, possibly giving an explanation 
of why the charges of different particle species (such as the electron and proton) are 
the same in magnitude. This speculation ultimately proved to be wrong, and I look 
back on the investigations that it inspired with mixed feelings, as being somewhat of 
a misadventure. On the one hand, my work on aspects of quantum electrodynamics 
led to a number of important papers with useful results, but on the other hand, my 
preoccupation with this program kept me from jumping into the emerging area of 
Yang— Mills unification at the point when much of the interesting theoretical work 
on non-Abclian theories was being done. 

My work on QED divided into three distinct phases, described in the following 
sections. The first part dealt with a calculation of the process of photon splitting 
in strong magnetic fields, which served as a warm-up for getting into the study 
of fermion loop diagrams. After this calculation was completed, I turned to an 
investigation of the renormalization group properties of QED, using as a tool the 
newly discovered Callan-Symanzik equations. Finally, in an attempt to get a better 
formalism for calculating the renormalization group (3 function contribution from 
closed loop diagrams, I worked out a compactification of masslcss QED on the 4- 
sphere, and applied this formalism to a number of theoretical issues. By the end 
of this phase, it was clear that developments in non-Abelian gauge theories were 
the future of the field of particle physics and, through grand unification, offered 
a compelling way to understand charge quantization, which had been the starting 
motivation for my interest in electrodynamics. So at this point I set my QED work 
aside and moved on to some of the phenomenological investigations described in 
Chapter 5. 
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Strong Magnetic Field Electrodynamics: 

Photon Splitting and Vacuum Dielectric Constant 

The discovery of pulsars with ultra-strong trapped magnetic fields led to a surge of 
interest in strong field QED processes, that are unobservably small for attainable 
laboratory magnetic fields. One of the processes of interest is photon splitting in 
a constant magnetic field, which is described by a closed electron loop Feynman 
diagram. When conversations at the Institute turned to whether this reaction could 
be of relevance in the dynamics of pulsar magnetospheres, my interest in getting 
into a general study of fermion loop processes in QED made it natural for me to 
get involved. The initial phase of this study led to a paper (Adler, Bahcall, Callan, 
and Rosenbluth, 1970, R25), that surveyed the basic features of the photon splitting 
process. Briefly, the lowest order box diagram makes a vanishing contribution, by 
an argument using Lorentz invariance and gauge invariance, and so the leading 
contribution comes from the hexagon diagram, with three insertions of the external 
magnetic field. (Earlier calculations had overlooked this fact, and so led to the 
wrong dependence on magnetic field strength.) Using the Heisenberg-Euler effective 
Lagrangian, we calculated the photon splitting absorption coefficients for the various 
photon polarization states relative to the magnetic field vector, to leading order 
in the external magnetic field, for photon energies small relative to the electron 
mass. We also gave the selection rules that result from the fact that the dielectric 
constant for the vacuum permeated by a strong magnetic field is different for the 
different photon polarizations (this was where Marshall Rosenbluth's expertise as a 
plasma physicist entered in), and made numerical estimates. Some of our results were 
independently obtained around the same time by Bialynicka-Birula and Bialynicki- 
Birula (1970). 

Again with the aim of getting more experience with QED calculations, I decided 
to embark on an exact calculation of photon splitting, for arbitrary magnetic fields 
and for arbitrary photon energies below the pair production threshold. This involved 
a very lengthy calculation using the proper time method, that Schwinger had first 
used (Schwinger, 1951) to give an elegant rederivation of the Heisenberg-Euler effec- 
tive Lagrangian. I derived general formulas for both the photon splitting amplitudes, 
and the refractive indices needed for the selection rules (in the latter case correcting 
an earlier result of Minguzzi (1956,1958a,1958b)) . I wrote a computer program to 
numerically evaluate the photon splitting absorption rates, and computed sample 
results, as well as giving a detailed discussion of possible plasma physics corrections 
to the selection rules. These results were all reported in a comprehensive article 
(Adler, 1971, R26) on photon splitting and dispersion in a strong magnetic field. 

My overall conclusion was that the leading order calculation from the hexagon 
diagram gives good order of magnitude estimates, as graphed in Fig. 8 of R26, which 
plots the ratio of the exact photon splitting absorption coefficient to the hexagon 
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diagram prediction, versus magnetic field, for photon frequencies equal to zero and 
equal to the electron mass m. This plot, incidentally, gives a check both on my 
exact analytic calculation and the numerical work, since the ratio approaches unity 
for small field strengths, where the hexagon dominates. For magnetic fields of order 
the "critical field" Bcr = m^/e ~ 4.41 x 10^ Tcsla (4.41 x 10^''^ Gauss), and photon 
frequencies of order the electron mass m, the photon splitting mean free path is 
much shorter than characteristic pulsar magnetosphere depths. However, since the 
absorption coefficients scale as for small fields, and since the pulsars known 
in 1971 tended to have fields of up to a few tenths of Bcr, the photon splitting 
process at that time seemed to be not of great astrophysical importance. Stoneham 
(1979) published an analytic recalculation of photon splitting by a different method 
(without numerical evaluation), which as we shall see agreed with my calculation. 
In an Appendix to his paper, he also improved on my estimate of the very small 
corrections that arise from the box diagram, when finite opening angles resulting 
from photon dispersion are taken into account, and we exchanged letters on this 
aspect of his work. However, after Stoneham's paper, interest in photon splitting 
waned for quite a number of years. 

In the mid 1990's, the discovery of "magnetars", pulsars with fields much higher 
than the critical field, revived interest in photon splitting. Around April, 1995, John 
Bahcall told me that recent papers by Mentzel, Berg, and Wunncr (1994) and Wun- 
ner. Sang, and Berg (1995) claimed that the photon splitting absorption coefficients 
for energetic photons in strong fields were a factor of 10^ higher than given in my 
1971 paper. If true, this would have had important astrophysical ramifications, so I 
looked back at my own work, and at the papers of the Wunncr group. I was struck 
by the fact that the Wunner group had not checked to see whether their calculation 
reproduced the known B^ dependence of photon splitting for weak fields and low 
energy photons, a consistency test that, as noted above, I had incorporated into 
my analytic and numerical work. So I strongly suspected that they had made an 
error, possibly through a lack of gauge invariance, and wrote a letter to this effect 
to the Wunner group, while John simultaneously wrote to Astrophysical Journal 
Letters, where their second paper was being considered for publication. Neither of 
these letters had any effect, and the Wunner, Sang, and Berg paper was published 
in December, 1995. John Bahcall and Bohdan Paczynski then urged me to make 
my private misgivings known more publicly. In response, I wrote a short IAS As- 
trophysics Preprint Series article in January, 1996 (Adler, 1996), expanding on my 
letter to the Wunner group, and concluding "it is important that their calculation 
and mine be rechecked by a third party, with the aim of understanding where the 
discrepancy arises and determining who is right." I submitted this note to the As- 
trophysical Journal, which rejected it. 

Although this short note was never published, it had the intended effect as 
a result of its internal circulation within the IAS. Not long afterwards Christian 
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Schubert, an IAS visitor at the time, came to my office and said that with new 
"stringy" Feynman rules with which he was expert, he thought he could repeat in 
a few days the calculation that had taken me a couple of months by the proper 
time method. I replied that if he could do that, I would deal with the numerical 
aspects. A week or two later Christian gave me two equivalent formulas for the 
photon splitting amplitude obtained by his methods; in the meantime, the Russian 
group of Baier, Milstein, and Shaisultanov (1996) had produced yet another calcu- 
lation, which agreed numerically with my 1971 paper. During a short visit to the 
Institute for Theoretical Physics in Santa Barbara, I wrote programs to directly 
compare Schubert's two expressions, my 1971 result, Stoneham's 1979 formula, and 
the analytic formula of the Russian group, all as applied to the allowed polarization 
case. (The reason for doing this numerically is that an analytic conversion between 
inequivalent Feynman parameterizations is very difficult, because zero can be writ- 
ten as a multidimensional integral in complicated ways.) The programs showed that 
the five calculations gave precisely identical amplitudes. This was reported in the 
paper that I drafted with Schubert on my return to the IAS (Adler and Schubert, 
1996, R27). We also posted my computer programs on my web site, and advertised 
this posting in the paper, so that the community at large could verify what wc had 
done. About a month later, I received an email from Wunner retracting the earlier 
numerical results of his group, which turned out to result from a single sign error in 
their computer programs. When this sign error was corrected, the analytic results of 
Mentzel, Berg and Wunner gave answers that agreed with everyone else, as discussed 
in Wilkc and Wunner (1997). Thus the photon splitting controversy was finally re- 
solved. Stibscqucntly, John Bahcall had me assemble a file of all the relevant papers 
and correspondence for a post-mortem meeting that he held with the editors of the 
Astrophysical Journal, to analyze and improve the process that that had allowed an 
incorrect paper to get into print, despite several advance warnings that the results 
were suspect. 



The "Finite QED" Program via the Callan— Symanzik Equations 

My comprehensive article on photon splitting was finished in early 1971, and the 
following summer I returned to my long-standing interest in a study of unresolved 
issues in the theory of quantum electrodynamics. Johnson, Baker, and Willey (1964), 
Johnson, Willey, and Baker (1967), and Baker and Johnson (1969, 1971a,b) had 
written an important series of papers (referred to below as JBW) in which they 
argued that if QED has a Gell-Mann-Low eigenvalue, then the asymptotic behavior 
of both the electron and photon propagators would drastically simplify, with the 
mass term in the electron propagator having power law scaling behavior, and the 
asymptotic photon propagator behaving, after charge renormalization, as if it had 
no photon self-energy part. Bill Bardeen and I were both in Aspen for part of the 
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summer of 1971, and we embarked on a study of QED using the then very new CaUan 
(1970)-Symanzik (1970) equations. Rather than addressing the issue of a possible 
eigenvalue in QED, we studied the simplified model suggested by the presence of 
such an eigenvalue, in which the photon propagator is taken as a free propagator 
with no self-energy part. In this case the /3 function term, which has a coupling 
constant derivative, is not present in the Callan-Symanzik equations, and these 
equations then can be explicitly integrated to give the simple form for the electron 
propagator found by JBW. These results were described in the paper Adler and 
Bardeen (1971), R28. In addition to giving results of interest for QED, this paper 
was one of the first applications of the Callan-Symanzik equations, and was also 
a motivation for my remarks at the Princeton conference later in 1971 (see R23), 
in which I suggested a possible connection between an eigenvalue condition in the 
strong interactions and Bjorken scaling. 

After finishing the paper with Bardeen, I turned to a detailed study of the full 
theory of QED, with photon self-energy parts retained, on which I wrote a com- 
prehensive paper Adler (1972a), R29. This paper had a number of new results. I 
began with a review of the original Gell-Mann-Low formulation of the renormal- 
ization group in QED, and then redid their analysis in terms of the more modern 
Callan-Symanzik approach, ending up in Eq. (53) with the explicit map between 
the Callan-Symanzik P{a) function and the functions 'ip{a) and q{a) that enter into 
the Gell-Mann-Low formulation. (An implicit form of this map had appeared in 
Sec. II. 3 of Symanzik (1970).) After reviewing the JBW program and the results 
obtained with Bardeen in R28, I showed by an argument based on the Federbush- 
Johnson (1960) theorem that i/ there is an eigenvalue in QED. then in the masslcss 
limit all 2n-point current correlation functions must vanish at the eigenvalue. I then 
went on to show, in an argument that benefited from a conversation with Roger 
Dashen, that the vanishing of higher correlation functions also implied the vanish- 
ing of all coupling constant derivatives of the photon proper self-energy part at the 
eigenvalue; hence the eigenvalue, if it existed, must be an infinite order zero of the 
one-loop P function. These were all correct results that give the paper an enduring 
value. 

I concluded the paper by proposing that in addition to the standard renormal- 
ization group result, in which the eigenvalue plays the role (through running of 
the coupling) of the unrenormalized fine structure constant ao, there could be an 
additional solution, resulting from a fermion-loopwise summation of the theory, in 
which the eigenvalue plays the role of the physical coupling a. A motivation for this 
proposal was that the formal power series argument, which shows the equivalence of 
loopwise summation to the usual renormalization group analysis, could break down 
in the presence of an essential singularity in the coupling. I then went on to conjec- 
ture that loopwise summation with an eigenvalue for a was the mechanism fixing 
the physical fine structure constant in a uniform manner for all fermion species. 
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As I have noted in the Introduction to this Chapter, this conjecture turned out to 
be wrong, and in retrospect my excessive emphasis on it in writing R29 distorted 
the presentation of an otherwise good paper. At the time key people working on 
the renormalization group, in particular Gell-Mann, Low, and Wilson, were all very 
skeptical. Wilson, in particular, remarked at a Princeton seminar that my demon- 
stration of an infinite order zero showed there could be no eigenvalue in QED, and 
although I was privately annoyed at the time, it is now clear that this was the correct 
conclusion. 

Finally, in an Appendix to my paper, I returned to the electron propagator 
analysis carried out in R28, this time in a general covariant gauge. This investigation 
was later reanalyzed in more detail, and improved, in a comprehensive study by 
Lautrup (1976). 

The final paper in this section, Adler, Callan, Gross, and Jackiw (1972), R30, 
studied the combined implications of the BJL limit, the nonrenormalization of 
anomalies, and the possible presence of an eigenvalue in QED. This paper, which 
was initially drafted by Roman Jackiw, grew out of discussions among the authors 
at Princeton and at the National Accelerator Laboratory. It shows that the following 
three phenomena are, when taken in combination, incompatible: (1) nonrenormal- 
ization of the axial-vector anomaly, (2) the existence of an eigenvalue in QED, (3) 
validity of naive scale invariant short-distance expansions involving the axial-vector 
current at the eigenvalue. Since the finite QED program was intended to eliminate 
the pathologies of QED, through presence of an eigenvalue, this showed that its aims 
could not be attained, and again cast strong doubt on the existence of an eigenvalue 
in QED. For later work coming to the same conclusion, and references to more re- 
cent literature, see Baker and Johnson (1979), and Acharya and Narayana Swamy 
(1997). On rereading R30 now, it occurs to me that the argument establishing a 
relation between the axial-vector anomaly and the Schwingcr term given in Section 
III may be extendable to show that the vanishing of anomalies in axial-vector loop 
diagrams coupling to four or more photons in QED implies, through similar use of 
a BJL limit, that the Schwinger term in the two-point function is a c-number. As 
noted in Chapter 3, this is a result that I was unable to prove, before the advent of 
the theory of anomalies, in 1966. For another approach to constraining the structure 
of the Schwinger term, see Jackiw, Van Royen, and West (1970). 



Compactification of Massless QED and Applications 

The fact that the eigenvalue condition for QED can be studied in the conformally 
invariant, massless electron theory, led me to study remappings of the Feynman rules 
for QED that make use of conformal invariance. In Adler (1972b), R31, I showed 
that the equations of motion and Feynman rules for massless Euclidean QED can 
be written in terms of equivalent equations of motion and Feynman rules expressed 
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in terms of coordinates that are confined to the surface of a unit hypersphere in 
5-dimensional space (a four-sphere in mathematical terms). For example, letting ry" 
be the coordinate on the sphere (where a runs from 1 to 5, and {ij"')^ = l), the 
usual four-vector potential is replaced by a five-vector obeying the constraint 
(with repeated indices summed) r/a^l" = 0, and the electromagnetic field strength 
is replaced by a three-index tensor Fabc = LabAc + Lbc^a + LcaAb, with Lab the 
5-space rotation generators. This tensor has a two-index dual F^b, and the Maxwell 
equations become L^bFabc = ^eJc , LabFbc = Fac- The corresponding 0(5) covariant 
Feynman rules are given in Table I of R31. The result of this transformation of the 
theory is an explicit demonstration that massless QED can be compactified, so 
that there are only ultraviolet divergences (corresponding to points approaching 
each other on the surface of the sphere, where it becomes tangent to Euclidean 
4-space), but no infrared divergences. The 0(5) rules, however, are not manifestly 
conformal invariant; in a later section of the paper I showed that they are related, by 
a projective transformation, to a manifestly conformal invariant (but non-compact) 
0(5, 1) formalism that was introduced earlier by Dirac (1936). 

In two subsequent papers I further developed and applied the 0(5) covariant 
formalism. In Adler (1973) I showed that the usual Feynman path integral takes 
the form of an amplitude integral, constructed as an infinite product of individu- 
ally well-defined ordinary integrals over coefficients appearing in the hyperspherical 
harmonic expansion of the electromagnetic potential Aa- In the paper Adler (1974), 
R32, I used the amplitude integral formalism to study a simple model, in which only 
a single photon mode of the form A^ oc viaTj ■ V2 ~ V2ar] • f 1, with vi^2 Orthogonal unit 
vectors, is retained. The external field Fredholni determinant or vacuum persistence 
amplitude A(e^) = det(i7 ■ d + e'^ ■ A) could then be studied by exploiting the 
0(3) X 0(2) residual symmetry of this model, which permits the external field prob- 
lem to be reduced to a set of two coupled first order ordinary differential equations, 
with a Wronskian equal to the Fredholm determinant. A significant result coming 
out of the analysis of this model was that the renormalized Fredholm determinant 
is an entire function of order four as eA becomes infinite in a general complex di- 
rection. This played a role in a subsequent discussion of asymptotic estimates in 
perturbative QED, as discussed in the paper of Balian, Itzykson, Zuber, and Parisi 
(1978), which followed up on an earlier paper of Itzykson, Parisi, and Zuber (1977). 
Whereas extrapolation from the solvable case of a constant field strength F^y sug- 
gested that the order of the Fredholm determinant is two, my solvable example 
showed that two cannot be the correct answer for general vector potentials. Balian 
et al. noted this and then went on to present further arguments for the determinant 
being of order four in four-dimensional spacetime, or more generally of order D in 
D-dimensional spacetime. This in turn had important implications for their study 
of asymptotic behavior of the perturbation series in QED. The subject of the or- 
der of the Fredholm determinant was further developed by Bogomolny. In an initial 
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paper by Bogomolny and Fatcycv (1978), the case of fields with an 0(3) x 0(2) 
symmetry group that I had initiated in R32 was taken up again, and an asymptotic 
formula for the Fredholm determinant was obtained. In a subsequent paper, Bogo- 
molny (1979) showed that this asymptotic formula, and a similar formula obtained 
by Balian et al. for another special case, could be extended to the general result 
limg^joo A(eA) = (e^/127r^) J (i'^x((^^)^) , provided is chosen to obey the non- 
linear gauge condition 9^(^^^^) = 0. Thus the order four result that I found in my 
"one-mode" model in fact gave the correct general answer for QED. 

A further application of the 0(5) formalism for QED emerged after the discov- 
ery of the instanton solution to the Yang-Mills field equations. Jackiw and Rebbi 
(1976) showed that the one-instanton solution is invariant under an 0(5) subgroup 
of the full conformal group, and hence can be rewritten in an elegant way in terms of 
the 0(5) formulation of electrodynamics, as extended to non-Abelian gauge fields. 
Letting aa be the 0(5) equivalent of the Dirac 7 matrices, and "jab = ( V^) [o^a > ctfc] > 
a matrix-valued vector potential Aa obeying the constraint rj ■ A = can be im- 
mediately constructed as Aa = Crjb'jab- Jackiw and Rebbi showed that when this 
vector potential is substituted into the Yang-Mills field equation as expressed in 
the non-Abelian extension of the 0(5) formalism, one gets a cubic equation for the 
coefficient C, two roots of which give pure gauge potentials with vanishing field 
strengths, but the third root of which gives the instanton! Thus, I had missed a sig- 
nificant opportunity in not pursuing the question, raised at least once when I gave 
seminars, of what the non-Abelian generalization of the 0(5) formalism was like. 
A variant of the non-Abelian 0(5) formalism was subsequently applied by Belavin 
and Polyakov (1977), with corrections by Ore (1977), to give a recalculation of the 
Fredholm determinant in an instanton background that was first computed by 't 
Hooft (1976). 
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5. Particle Phenomenology and Neutral Currents 



Introduction 

Much of the work described in Chapters 2 and 3 on soft pion theorems, sum rules, 
anomalies, and neutrino reactions falls in the category of phenomenology, but both 
the interrelations between different aspects of this research, and the chronology, sug- 
gested that it be discussed earlier. Even before this work was done, I wrote my first 
particle phenomenology paper in collaboration with my first year Princeton gradu- 
ate school roommate, and former Harvard classmate, Alfred Goldhaber (Adler and 
Goldhaber, 1963). In this paper we analyzed the possibility of using the deuteron to 
provide a polarized proton target, by determining the polarization of the recoiling 
spectator neutron through its scattering on Hc'^. Although perhaps feasible, this 
proposal was never implemented, and much better methods for directly obtaining 
polarized targets are now available. After I completed the work on quantum elec- 
trodynamics described in Chapter 4, I returned to phenomenology in a number of 
papers written, or conceived, during visits to the National Accelerator Laboratory 
(subsequently renamed the Fermi National Accelerator Laboratory, or Fermilab), 
and continued with related work in a number of papers written at the IAS. I dis- 
cuss the earlier work done at Fermilab in the first section that follows, and then in 
the second section take up work at both Fermilab and the IAS relating to neutral 
currents. 

Visits to Fermilab 

When the National Accelerator Laboratory was inaugurated, my former thesis ad- 
visor Sam Treiman was brought in, on a succession of leaves from Princeton starting 
in 1970, to serve as temporary head of the Theory Group, with the charge of set- 
ting it up and recruiting a permanent head. Subsequently, Ben Lee was hired to be 
the permanent head of the Theory Group. During this period many theorists from 
outside institutions were invited to be term time and/or summer visitors, and as 
part of this program I made a series of visits to Fermilab, and wrote a number of 
phenomenological papers growing out of discussions with people there. 

As already noted in Chapter 3, during a 1971 visit to Fermilab I collaborated 
with Lee, Treiman, and Tony Zee to study the anomaly-based prediction for the pro- 
cess 77 Stt, described in the paper R20. This was applied in a subsequent paper 
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that I wrote with Glcnnys Farrar and Trciman (Adler, Farrar, and Treiman, 1972, 
R33) to an analysis of the contribution of three pion intermediate states to the rare 
kaon decay Kl l^^f^ - The background for this study was what was then called 
the "i^L puzzle", the fact that experiment had not detected this kaon 

decay mode at a level considerably below that given by a iniitarity bound based on 
the assumed dominance of a two photon intermediate state in the absorptive part 
of the decay amplitude. There were thus two possibilities, either an experimental 
problem, or destructive interference with another intermediate state, for which the 
three pion intermediate state was a prime candidate. Aviv and Sawyer (1971) had 
proposed to use soft pion methods to estimate the three pion contribution, and had 
concluded that the contribution was much too small to be relevant. However, the 
Aviv-Sawyer analysis used an expression for the Stt — > 77 amplitude which had 
been shown in R20 to be incorrect. In R33, we estimated the three pion contribution 
by using the corrected Svr 77 amplitude calculated in R20, but still found that 
it gave much too small a contribution to explain the lack of observed Kl A*"*"/^" 
events. Similar conclusions, again using the results of R20, were reached indepen- 
dently by Pratap, Smith, and Uy (1972). Ultimately, the origin of the "Kl — M"*"/^" 
puzzle" turned out to be experimental, and this decay mode has now been seen in 
a number of experiments, with the Particle Data Group giving an average value for 
r(/Lt+/Lt~)/rTOT of ~ 7.2 X 10~^, as compared with the theoretical unitarity lower 
bound of 7.0 x 10~^ based on the current Kl 77 branching ratio. 

During the years 1973-1974, my Fermilab visits led to papers in two separate 
areas, searches for neutral currents in weak pion production, and the analysis of 
what was then a discrepancy between theory and experiment in ^u-mesic atom x-ray 
spectra. I will take up this second area first, because the neutral current work leads 
directly into the papers discussed in the next section. My interest in the jU-mesic 
atom discrepancy was stimtilatcd by my earlier work on quantum electrodynamics, 
since an eigenvahic in QED could show up as deviations from the standard pertur- 
bation theory predictions for vacuum polarization effects. Thinking about tests for 
vacuum polarization discrepancies in QED led me to think more generally about 
other aspects of vacuum polarization, in particular the predictions for the ratio 
R{s) = (j{e^e~ hadrons; s)/o"(e'''e~ h~^ij,~;s) in various models for quark 
structure of hadrons. This offshoot of the QED work led to results that are still 
used today, introduced in the paper Adler (1974a), R34, dealing with "Some simple 
vacuum-polarization phenomenology...". My basic observation was that whereas R 
is measured in the timelike region, the natural place to compare experiment with 
scaling predictions of various theories is in the spacelikc region, where (since there 
are no threshold effects) one might expect an early or "precocious" onset of scaling. 
Rather than directly using the dispersion relation for the vacuum polarization part 
to calculate the spacelike continuation, I proposed using its first derivative, and so 
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defined a function 

^ duRju) 

This function is the one for which parton models and QCD most directly make pre- 
dictions, and since it is positive definite and involves a strongly convergent integral 
(for R approaching a constant), the experimentally inaccessible high energy tail has 
a known sign and a magnitude that can be bounded. For a parton model in which 
R asymptotically approaches a constant C, one has T{—s) ~C/sass— >-oo, and a 
similar formula holds in QCD with a known logarithmic correction. The paper R34 
used the function T{—s) to propose a test of the colored quark hypothesis. Subse- 
quently, De Riijula and Georgi (1976) used a modified version of this idea, defining 
D{s) = sT{—s), to analyze the new SPEAR data. They found that the original 
colored quark model was excluded, and among various viable possibilities, noted 
that "the standard model with charm is acceptable if heavy Icptons are produced," 
a conclusion that was borne out by experiment with the subsequent discovery of 
the r lepton. Shortly afterwards, Poggio, Quinn, and Weinberg (1976) proposed a 
generalized method in which the derivative of the hadronic vacuum polarization 
that I had used is replaced by a finite difference between the hadronic vacuum po- 
larization values at points a distance ibiA from the timclikc real axis, leading to 
a "smeared" average of R{s) that retains sensitivity to threshold effects. Recently, 
my original method, generally in the form D{s) used by De Riijula and Georgi, has 
been revived under the name of the "Adler function", in a number of papers; see, 
for example, Broadhurst and Kataev (1993), Kataev (1996); Peris, Perrottet, and de 
Rafael (1998); Beneke (1999); Eidelman, Jegerlehner, Kataev, and Veretin (1999); 
Kataev (1999); Cvetic, Lee, and Schmidt (2001); Cvetic, Dib, Lee, and Schmidt 
(2001); Milton, Solovtsov, and Solovtsova (2001); and Dorokhov (2004). 

In the second part of R34 I examined what was then a discrepancy between theory 
and experiment in //-mesic atom x-ray transition energies, under the assumption 
that (if real) the discrepancy arose from a nonperturbative correction 6p to the 
vacuum polarization absorptive part. Assuming that dp is positive, or positive and 
monotonic, I derived lower bounds on the corresponding deviation that would be 
expected in = ^{g^ — 2). For instance, if 6p is assumed positive and monotonic, 
comparison of the kernels that weight p in the formulas for the x-ray transition 
energies and for gives the bound (5a^ < —(0.98 it 0.18) x 10~^. In a follow-up 
paper with Roger Dashen and Sam Treiman (Adler, Dashen, and Treiman, 1974) 
we discussed other tests for a nonperturbative vacuum polarization contribution, 
and also placed bounds on the mass of a light scalar meson that could be invoked 
to explain the x-ray discrepancy. A few months later, Barbieri (1975) extended 
the method of R34 to show that precision measurements of the (/x^He)"^ system 
were already at variance, within the vacuum polarization deviation or scalar meson 
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exchange hypotheses, with the supposed x-ray discrepancy. A later paper of Barbieri 
and Ericson (1975) gave additional evidence against the scalar meson explanation 
for the x-ray discrepancy. In the meantime, during 1975 and the few years following, 
there were a number of experimental developments, reviewed in detail in Borie and 
Rinkcr (1982), as a result of which the muonic x-ray discrepancy was eliminated. 
Incidentally, the current theoretical and experimental values of differ by a few 
parts in 10~^, well below the lower bounds on (5a^ inferred in R34 from the //-mesic 
atom x-ray data at that time, giving an additional indication that that the purported 
x-ray discrepancy was an experimental artifact. 



Neutral Currents 

The existence of weak neutral currents is a principal prediction of the Glashow- 
Weinberg-Salam electrowcak theory, and commanded much attention in the 1970s. 
Failure to find weak neutral currents would have falsified the electrowcak theory, 
and on the other hand, detection of weak neutral currents would give a value for 
the electroweak mixing angle 6w , which in turn determines the masses of the heavy 
intermediate bosons of the theory. As a result of my thesis work on weak pion 
production, it was natural for me to get interested in theoretical estimates of the 
neutral current weak pion production channels v + Ni ^ i' + tt + Nf, with Ni^ f a 
nucleon (either a neutron or proton) and with vr a pion of appropriate charge. In 
July 1972, a collaboration with Wonyong Lee as spokesman proposed a study of 
weak neutral currents in both the purely leptonic and the pion production channels 
at the Brookhavcn AGS accelerator, and a copy of their proposal is in my files. 
Through this, and through related correspondence of Ben Lee with Sam Treiman, I 
got interested in doing detailed calculations for this process, and over the next few 
years was in frequent touch with the experimental group for which Wonyong Lee 
was spokesman. 

My initial papers were motivated by the fact that preliminary estimates of neu- 
tral current weak pion production by Ben Lee (1972) appeared to conflict with 
experiments in complex nuclei reported by Wonyong Lee (1972), subject to two 
caveats. The first caveat was that Ben Lee's static model estimates didn't include 
1=1/2 contributions to weak pion production, and the second caveat was that nu- 
clear charge exchange corrections could be important, as noted by Perkins (1972). 
The first of these issues was dealt with in a short paper Adler (1974b), R35, where I 
used my model of R15, as adapted to the neutral current case, to estimate the effects 
of including the nonresonant isospin 1/2 channels, and concluded that they had lit- 
tle effect on Ben Lee's estimate from the dominant isospin 3/2 channel. The second 
issue was dealt with in a paper on nuclear charge exchange corrections to pion pro- 
duction in the A(1232) region, that I wrote in collaboration with Shmuel Nussinov 
and Emmanuel Paschos (Adler, Nussinov, and Paschos, 1974, R36). In this paper. 
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we estimated the effects of multiple charge exchange scattering on pion production 
in nuclear targets, using an extension of techniques used by Fermi and others to 
calculate multiple neutron scattering in the early days of neutron physics. A con- 
siderable part of the fun of writing this paper was learning about this older work on 
neutron physics, and feeling a sense of continuity between current concerns of weak 
interaction physics and the quite differently motivated work of an earlier generation. 
In addition to giving analytic formulas, we tabulated various results for the case of a 
laAl^^ target, as appropriate to experiments with aluminum spark chamber plates. 
In R36, we made the simplifying assumption of an isotopically neutral target (that 
is, equal numbers of neutrons and protons), which is exact for qC^^, and a good 
approximation for aluminum. In a follow up paper (Adler, 1974c), I extended the 
model to nuclear targets with a neutron excess. As can be seen from Table II of R36, 
charge exchange corrections are sizable, and in our model typically reduce the ratio 
of neutral current to charged current tt^ production by about 40%. 

My next paper on neutral currents was motivated by the fact that preliminary 
results of an experiment on weak pion production in hydrogen at Argonne National 
Laboratory showed a cluster of neutral current events just above threshold. In this 
kinematic regime soft pion methods should apply, allowing one to relate threshold 
neutral current weak pion production in the standard electrowcak theory to the 
elastic neutral current cross section loi v + p v + p. Using this relation, I showed 
in Adler (1974d), R37 that one could place bounds on the expected number of 
neutral current pion production events in the threshold region, with the Argonne 
results exceeding these bounds. Thus, there seemed to be stronger neutral current 
weak pion production than suggested by the SU{2) x U{1) electrowcak theory. 

Subsequent events then proceeded on several parallel tracks. In a follow-up paper 
to R37, published as Adler (1975a), R38, I used the full apparatus of my weak pion 
production calculation of R15 to extend the neutral current calculation above the 
threshold region to include the regime where A (1232) production dominates. This 
analysis reinforced the conclusions about the preliminary Argonne data already 
reached in R37. Simultaneously, with a large group of postdocs at the Institute, I 
embarked on a study of weak pion production in alternative models of neutral cur- 
rents with scalar, pseudoscalar, and tensor currents, and also with so-called "second 
class" (abnormal G-parity) currents. Additionally, in Adler, Karliner, Lieberman, 
Ng, and Tsao (1976), we did a detailed study of isospin-1/2 resonance production 
by V,A neutral currents. Perhaps the one part of the group effort on alternative 
current structures to have lasting value was a calculation of nucleon to nucleon and 
pion to pion matrix elements of scalar, pseudoscalar, and tensor current densities, 
using all the theoretical tools then at our disposal: flavor SU^, and chiral SU^ x SU^ 
symmetries, the quark model, and the MIT "bag" model. The results of these cal- 
culations were checked by several of us, and tabulated in Adler, Colglazier, Healy, 
Karliner, Lieberman, Ng, and Tsao (1975), R39; they were subsequently relevant for 
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estimates of the coupling to nucleons of hypothetical scalar and pseudoscalar par- 
ticles, such as axions. The main part of the group effort was a current algebra soft 
pion production calculation for the alternative current case, which involved exten- 
sive algebra and computer work. Prom this, we found that one could explain roughly 
half of the reported Argonne threshold events with currents of scalar, pseudoscalar, 
and tensor type, by allowing some deviations from the matrix element estimates of 
R39, as I reported at the January, 1975 Coral Gables Conference (Adler, 1975b). 
In the meantime, the Argonne group reexamined possible background problems af- 
fecting their preliminary results, with the result that they ultimately discounted the 
cluster of pion production events near threshold. So by September of 1975, when I 
reviewed the subject of gauge theories and neutrino interactions at a conference at 
Northeastern University (Adler, 1976a), the electroweak theory predictions for neu- 
tral current weak pion production, following from purely V and A currents, were no 
longer in conflict with experiment. This conclusion was reinforced by a subsequent 
detailed analysis by Monsay (1978) of neutral current weak pion production, using 
my model together with the charge exchange corrections of R36. 

In the summer of 1975 I lectured on neutrino interactions and neutral currents 
at the Sixth Hawaii Topical Conference on Particle Physics, and gave a compre- 
hensive survey of neutral current phenomenology based on parton model methods, 
soft pion theorems, and quark model calculations of baryon static properties. This 
appeared both in the conference proceedings (Adler, 1976b) and again in a tenth 
year anniversary volume selecting highlights from the preceding summer schools 
(Pakvasa and Tuan, 1982). My hope in preparing the 1975 lectures was that survey- 
ing all available tools would hasten the day when one could determine electroweak 
parameters based on using all available data for a global fit, instead of doing piece- 
meal fits channel-by-channel. Such a global fit was carried out a few years later 
by Abbott and Barnctt (1978a,b), who included four types of data: deep inelastic 
neutrino scattering uN uX, elastic neutrino-proton scattering vp up, neu- 
trino induced inclusive pion production fN vttX, and neutrino induced exclusive 
pion production vN — > uttN. For the exclusive pion process, they employed my 
weak pion production calculation of R15 as extended to neutral currents in R38, 
using test data that I ran for them from my programs as benchmarks to help debug 
their programming. Their results were, in the words of their letter Abstract, "for 
the first time, a unique determination of the weak neutral-current couplings of u 
and d quarks. Data for exclusive pion production are a crucial new input in this 
analysis." Their multi-channel fit gave the first full confirmation that the Glashow- 
Weinberg-Salam model, with sin^ 9\y between 0.22 and 0.30, was in agreement with 
the experimental up and down quark neutral current coupling parameters. To me, 
the Abbott-Barnett analysis was valued recompense for the several years of hard 
calculation and scholarly attention to detail that I had put into the subject of weak 
pion production. 
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6. Gravitation 



Introduction 

During the first half of the 1970's, I started to get interested in learning more about 
gravitational physics. When I was a graduate student at Princeton in the early 
1960's, particle physics and gravitational physics were quite separate subjects, with 
the former the domain of Goldbcrgcr and Trciman, and the latter the domain of 
Wheeler and Dickc, to mention just a few key faculty members. Under the un- 
structured system at Princeton, I never took a course in gravitation, and for my 
general exam got by with the introduction to general relativity that I obtained by 
reading the text of Peter Bergmann (1942), as well as reading some of the original 
Einstein papers reprinted in a Dover edition. (Working through the Dover volume 
was a project of an informal reading and discussion grotip during my senior year 
at Harvard, organized by Norval Fortson, an experimental physics graduate student 
affiliated with the residential house where I lived then.) However, in the 1970's it 
became clear both that many new results had been obtained in general relativity, 
so that my undergraduate knowledge was out-of-date, and that general relativity 
was becoming part of the essential tool kit of people working in quantum field the- 
ory. Among the things that convinced me of this were reading the thesis of Stephen 
Fulling (1972) on scalar quantum field theory in de Sitter space, while I was working 
on the 0(5) formulation of QED, the work of 't Hooft and Veltman (1974) and Deser 
(1975) on one-loop divergences of quantum gravity, and the availability of the new 
books on gravitation of Weinberg (1972), Misner, Thorne, and Wheeler (1973), and 
Hawking and Ellis (1973). 

My intention in writing my comprehensive Hawaii lectures in the summer of 
1975 was to wind up my involvement with neutrino physics, so that I could turn 
to something new. Since in 1976 I was due for a sabbatical, and my family did not 
want to travel away from Princeton, I decided that to learn relativity I would take a 
"reverse sabbatical" , by going to Princeton University to teach the relativity course 
for a year. So I spent my evenings during the 1975-1976 academic year reading the 
texts of Weinberg and of Misner, Thorne, and Wheeler, and then took my sabbatical 
during the 1976-1977 academic year, teaching both the fall term course in Special 
Relativity and the spring term continuation course in General Relativity. I also was 
the faculty advisor for John David Crawford, who did a senior thesis on experimental 
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tests for curvature squared additions to the gravitational action. With this reading 
and teaching as background, I embarked on a number of relativity-related research 
projects, described in the next two sections. 



First Papers 

My first papers on gravity were the working out of a very speculative idea, that grav- 
itation might be a composite phenomenon, with the gravitational fields arising as 
composite "pairing" amplitudes of photons in analogy with the energy gap order pa- 
rameter for superconductivity. In the paper Adler, Lieberman, Ng, and Tsao (1976), 
we looked for weak coupling singularities in the electromagnetic photon-photon lad- 
der graph sum in a conformally flat spacetime, and found some resemblances to 
the helicity structure of graviton exchange amplitudes. In a follow-up paper (Adler, 
1976) I gave a linearized Hartree formulation for the photon pairing problem in a 
general background metric. I was never able to establish a detailed connection be- 
tween photon pairing amplitudes and graviton couplings in the general case, and the 
fact that no weak coupling singularities occurred in flat spacetime meant that one 
could not establish a connection with the standard results of linearized general rela- 
tivity. In retrospect, the absence of pairing effects in flat spacetime could have been 
expected from a subsequent theorem of Weinberg and Witten (1980), that ruled 
out spin-2 composites under quite general assumptions, and effectively doomed the 
program as set up in the 1976 papers. However, a useful outcome of writing these 
papers was that it started me thinking more generally about the idea of gravitation 
as an effective theory, and in particular about Sakharov's ideas on gravitation, which 
I briefly discussed in the paper Adler (1976); following up this direction later on led 
to my work on the Einstein action as a symmetry-breaking effect, discussed in the 
next section. 

A second topic that I worked on in 1976 was the regularization of the stress- 
energy tensor for particles propagating in a general background metric. In the paper 
Adler, Lieberman, and Ng (1977), we applied covariant point-splitting techniques to 
the Hadamard series for the Green's functions, which we used to define a regular- 
ized stress-energy tensor for vector and scalar particles. This was a very technical 
computation, and contained useful formulas among its results, but also produced an 
embarrassment: by our method of regularization, we did not find the trace anomaly 
that had been found by others using different methods. We rechecked our calcula- 
tion carefully, but could not flnd the source of the discrepancy. The problem was 
finally resolved by Wald (1978) (in time to be described in a note added in proof 
to our 1977 paper). Wald had earlier (Wald, 1977) set up a general axiomatization 
for the stress-energy tensor, and in Wald (1978) had shown that it leads to an es- 
sentially unique result. Applying a point-separation method similar to ours, he had 
also found no trace anomaly, but then went on to note that there was a subtle error 
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in our analysis. Wc had assumed that the local and boundary-condition-dependent 
parts of the Hadamard solution are separately symmetric in their arguments, but 
this is in fact not the case; only their sum is symmetric. Wald (1978) showed in the 
scalar case that when the analysis is repeated without the incorrect assumption, one 
gets the standard trace anomaly. Judy Lieberman and I then did the corresponding 
calculation in the vector particle case (Adlcr and Lieberman, 1978, R40), again find- 
ing that when the asymmetry of the two pieces of the Hadamard solution is taken 
into account, one gets the correct trace anomaly. 

In a lunchtime conversation at some point during the 1977-1978 academic year, 
Robert Pearson asked whether the "no-hair" theorems of general relativity applied 
to the case of spontaneous symmetry breaking. I thought this was interesting and 
looked into it, finding no relevant papers in the literature. This became the subject 
of a joint paper (Adler and Pearson, 1978, R41), which showed that the standard 
"no-hair" theorems generalize to the vector field in the Abelian Higgs model, and 
to the non-conformally invariant Goldstone scalar field model. In our paper, we 
restricted ourselves to static, spherically symmetric black holes, and made the phys- 
ically motivated assumption that any "hair" would also be static and spherically 
symmetric. This permits a simplifying choice of gauge for the Abelian Higgs model 
introduced by Bckcnstein (1972). He observes that static electric charge distribu- 
tions must give rise to static electric fields and vanishing magnetic fields. Thus one 
can find a special gauge in which the potentials obey ^ = 0, dAo/dt = 0. Since 
the gauge-independent source current obeys similar conditions j = 0, djo/dt = 0, 
and since the gauge-independent magnitude of the Higgs scalar field is static, one 
finds that the residual phase of the Higgs scalar field in the special gauge is a space- 
independent, linear function of time, which can be eliminated by a further gauge 
transformation that preserves the gauge conditions A = 0, dAo/dt = 0. Thus one 
can do the analysis of possible "hair" taking the vector potential to be zero, and 
the Abelian Higgs field to be real. I have described Bckenstein's argument here in 
some detail because the choice of gauge in R41 is the basis of rather loosely worded 
objections to our paper in lectures of Gibbons (1990); his assertion (and that of 
authors who have quoted his lectures) that the gauge choice is problematic is not 
correct, as working through the Bekenstein argument given above makes clear. Also, 
I have rechecked the proof given in R41, and apart from the minor problem found 
by Ray, as discussed below, I find that the proof is correct, in disagreement with 
further statements in Gibbons' lecture. However, in response to Gibbons' comments 
about our choice of gauge, proofs of the "no-hair" theorem for the Abelian Higgs 
model that do not use a special gauge choice have since been given by Lahiri (1993) 
and by Ayon-Beato (2000). 

Our argument starting from Eq. (24) of R41 was subsequently considerably sim- 
plified, and in the case when d6/dX\H = corrected, in a paper of Ray (1979). (The 
subscript H here refers to evaluation at the horizon; see R41 for details of this and 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



Gravitation 81 

other notation used in the following discussion of Ray's paper.) The minor problem 
noted by Ray resulted from our not dropping the subdominant term d9/d\ on the 
right-hand side of Eq. (31) when integrating this equation to get Eq. (33), so as 
to be consistent with our dropping this term elsewhere, such as in Eq. (32). When 
this term is dropped, the 6*"^/^ factor in Eq. (33) is replaced by a constant, and the 
approximate solution of Eq. (33) agrees with the exact solution of Eq. (24) given by 
Ray. As Ray points out, with this correction one still finds that q~^(fP' is infinite at 
the horizon unless K = 0, which is what is needed to complete the proof. 

Finally, I note that the subject of black hole "hair" in gauge theories has taken 
on new interest recently with the discovery that topological charges on a black hole 
can give nonzero effects outside the horizon; see, for example, Coleman, Preskill, 
and Wilczek (1992) and the related lectures of Wilczek (1998). 



Einstein Gravity as a Symmetry Breaking Effect 

In late January of 1978 I organized a small conference on "Geometry, Gravity and 
Field Theory" for the EST Foundation in San Francisco; this was a memorable 
event that was attended by a large fraction of the leading people with interests 
in quantum gravity. During my plane travel for this conference, and afterwards, 

1 started to think about the confinement problem in QCD, and this became the 
main focus of my research for the next two years, as described in the following 
chapter. However, learning about scale breaking in QCD also led me back into 
gravitational physics, through considering the role similar mechanisms might play 
in giving a quantitative form to the suggestion by Sakharov (1968) (see also Klein, 
1974) that Einstein gravity is the "metric elasticity" of spacetime. I did not arrive at 
the correct formulation immediately; I find in my files two unpublished manuscripts, 
the first positing monopole boundary conditions, and the second positing dimension- 

2 operators, as a source for symmetry breaking, in both cases suggesting connections 
with the Einstein-Hilbert action. I went as far as submitting a manuscript based on 
the second for publication, and also gave a seminar on it at Princeton University, 
where my arguments were torn to shreds by David Gross (following which I withdrew 
the manuscript). The criticism proved useful; I went home, learned more about 
dimensional transmutation and the theory of calculability versus renormalizability, 
and came up with the correct formulation given in Adler (1980a), R42. The basic 
idea here is that in theories which contain no scalars, so that scale invariance is 
spontaneously broken (QCD is a prime example, but "technicolor" type unification 
models also fit this description) , there will be an induced order R term in the action 
in a curved background, with a coefficient that is calculable in terms of the scale 
mass of the theory. Thus, if an underlying unified theory spontaneously breaks scale 
invariance at the Planck scale, one can induce the Einstein gravitational action as a 



February 2, 2008 2;58 



Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics 



82 



Adventures in Theoretical Physics 



scale-symmetry breaking effect, giving an explicit realization of the Sakharov-Klein 
idea. 

I followed up this paper with a second one (Adler, 1980b, R43) in which I gave an 
explicit formula for the "induced gravitational constant" in theories with dynamical 
breakdown of scale invariancc, expressed in terms of the vacuum expectation of the 
autocorrelation function of the trace of the renormalized stress-energy tensor T^i,, 



This formula for the induced Newton constant was independently obtained at about 
the same time by Zee (1981), and in the subsequent literature, the term "induced 
gravity" has come to be frequently used to describe the whole set of ideas involved. 
These papers attracted considerable attention in the gravity community, one result 
of which was that Claudio Teitelboim and his colleagues at the University of Texas 
in Austin invited me to give the Schild lectures in April of 1981. (My four lectures 
over a two week period, entitled "Einstein Gravity as a Symmetry-Breaking Effect 
in Quantum Field Theory", were the eleventh in the Schild series.) This proved 
memorable for an unanticipated reason; shortly before I was to go to Texas I con- 
tracted a mild case of what was probably type-A hepatitis (the kind transmitted 
by shellfish), and so was sick in bed with very little energy. I dragged myself out of 
bed on alternate days to write lecture notes, and then was so tired I had to sleep 
the entire day following. At any rate, I improved enough so that my doctor gave 
me permission to go to Texas, where Philip Candelas took me into his home and 
helped me get through my scheduled lectures. Ultimately, I expanded the lectures 
into a much-cited comprehensive article that appeared in Reviews of Modern Physics 
(Adler, 1982, R44). A year later, I wrote a briefer synopsis of the program of gener- 
ating the Einstein action as an effective field theory, for a Royal Society conference 
on "The Constants of Physics", which was published as Adler (1983). 

The explicit formula for the induced gravitational constant raises a number of 
interesting issues. First of all, if one assumes an unsubtracted dispersion relation 
for the Fourier transform 'i/'(9^) of the autocorrelation function of the stress-energy 
tensor trace, the induced gravitational constant is negative. However, as shown by 
Khuri (1982a) using analyticity methods, in asymptotically free theories there are 
three possible cases, depending on the distribution of zeros of ip{q'^), and in one of 
these cases Gind has positive sign. In further papers Khuri (1982b,c) showed that 
in this case one can place useful bounds on the induced gravitational constant, 
expressed in terms of the scale mass of the theory. 

The question of whether the formula for the induced gravitational constant gives 
a unique answer has been discussed, from the point of view infrared renormalon sin- 
gularities, by David (1984) and in a follow-up paper of David and Strominger (1984). 
These authors argue that renormalons introduce an arbitrariness into the calculation 
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of Gind) SlS manifested through the fact that in the dimensional regularization of the 
ultraviolet singular "comparison function" ^c(^) introduced in Eq. (5.48) of R44, 
one has to continue onto a cut. In Appendix B, Section 3 of R44, I used a principal 
value prescription to deal with this, which David argues can be modified by taking 
complex weightings of the upper and lower sides of the branch cut, allowing a free 
parameter multiple of the imaginary part to be introduced into the calculation of 
the integral over the comparison function. David argues that this means that the 
expression for Gind has an inherent ambiguity. I believe that this conclusion is sus- 
pect; since QCD and similar theories that spontaneously generate a mass scale arc 
believed to be consistent field theories, their curved spacetime embeddings should, 
by the equivalence principle, also be consistent theories. This strongly suggests that 
the coefficient of the order R term in a curvature expansion of the vacuum action 
functional should be well defined, and that the ambiguity is an artifact of the com- 
parison function procedure. This view is supported by the review article of Beneke 
(1999) on renormalons, where it is argued that renormalon ambiguities are typically 
canceled by corresponding ambiguities in non-perturbative terms (such as the inte- 
gral AIuv with integrand * — ^'c(i) in Eq. (5.48)) , giving total physical amplitudes 
that are unambiguous. In other words, the renormalon ambiguities are an artifact of 
an attempted separation of QCD physical amplitudes into a "perturbative" and a 
"non-perturbative" part, and only indicate that if a branching prescription (such as 
a principal value) is needed for the perturbative part, then a corresponding branch- 
ing prescription is also needed for the non-perturbative part. This will make the 
calculation of quantities like Gjnd difficult, but does not imply that the calculation 
cannot, in principle, give a unique, physical answer. In the paper of David and Stro- 
minger (1984), the authors show that Gmd is unambiguous in finite supcrsymmetric 
theories, giving an existence proof that there are theories with a finite induced New- 
ton's constant. In the general case, they acknowledge that "there is no proof that 
Gind will necessarily be ambiguous" , and I suspect that in fact Gind will turn out to 
be well defined in a much wider class of supcrsymmetric and non-sup ersymmetric 
theories than only finite ones. Clearly, this is a question that merits further study. 

If one thinks more generally about the structure of a fundamental theory of 
gravitation, there are a number of possibilities. It may be that the Planck length is 
the minimum length scale possible, because of an underlying "graininess" of space- 
time. Or spacetime may be a continuum, as generally assumed, in which case the 
Planck length plays the role of the scale at which a classical metric breaks down, 
with new dynamical principles taking over at shorter distances. The suggestion that 
the order R gravitational action is an expression of scale symmetry breaking in a 
more fundamental scale-invariant theory is clearly based on a continuum picture 
of spacetime. A continuum assumption is also made in string theories, which how- 
ever are not scale-invariant; in string theories a fundamental length scale (the string 
tension) appears in the action, and this directly sets the scale for the gravitational 
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action. Should spacetime turn out to be discrete or grainy, there may be more gen- 
eral forms of the induced gravitation idea that are relevant. Ultimately, the origin of 
the spacetime metric, and of the Einstein-Hilbert gravitational action that governs 
its dynamics, will not be certain until we have a unifying theory that also resolves 
the cosmological constant problem, which is not dealt with in any of the current 
ideas about quantum gravity. 
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7. Non-Abelian Monopoles, Confinement Models, 
and Chiral Symmetry Breciking 



Introduction 

The somewhat disparate topics to be discussed in this chapter are all connected 
through my interest during the late 1970's and early 1980's in studying nonper- 

turbative properties of quantum chromodynamics (QCD), the theory of the strong 
interactions. I began these investigations by looking for a semi-classical model for 
heavy quark confinement. My first idea, that quarks might be confined in a non- 
Abelian monopole background field, did not work, but led to interesting progress 
in the theory of monopoles, as described in the first section. Most significantly, as 
discussed in detail, the monopole work led indirectly to the completion by Clifford 
Taubes of his multimonopole existence theorem during a visit to the IAS in the 
spring of 1980. I then turned to models based on the nonlinear dielectric properties 
of the QCD vacuum, which led to the confinement of quarks in "bag" -like structures 
which yield good heavy quark static potentials, as discussed in the second section. 
Finally, at the end of this period I worked briefly on the spontaneous breaking of 
chiral symmetry in QCD within the framework of pairing models patterned after 
superconductivity, as discussed in the final section. All three of these aspects of my 
study of QCD involved heavy numerical work, which in turn led to my interest in 
algorithms discussed in the next chapter. 

Non-Abelian Monopoles 

My first attempt at the confinement problem, which did not succeed but which had 
useful by-products that I shall describe here, was based on the idea of considering 
the potential between classical quark sources in the background of a non-Abelian 
't Hooft (1974a)-Polyakov (1974)-Prasad-Sommerfield (1975)~Bogomornyi (1976) 
monopole or its generalizations, which I conjectured in Adler (1978b), R45 might act 
as a quark-confining "bag". To justify considering classical quark sources, I initially 
resorted to a scheme (Adler, 1978a) that I called "algebraic chromodynamics", which 
involved looking at the color space spanning the direct product of independent 
color charge matrices. However, I eventually dropped this apparatus in my pursuit 
of the confinement problem, and used instead the popular approximation of color 
charge matrices lying in a maximal Abelian subgroup of the SU (3) color group of 
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conventional QCD, which gives a good first approximation to the full QCD color 
structure. Since it is clear that source charges in classical Yang-Mills theory are not 
confined, I looked for a simple modification of this theory that might lead to a linear 
potential. The first idea I tried was to look at classical Yang-Mills charges in the 
field of a background monopolc. This had the obvious problem that the monopole 
scale has no clear relation to the QCD scale set by dimensional transmutation, but 
I simply ignored this difficulty and plunged ahead. 

To pursue (and ultimately rule out) the conjecture that a monopole background 
would confine, I did a number of calculations of properties of monopole solutions. 
The first was a calculation of the Green's function for a single Prasad-Sommerfield 
monopole, by using the multi-instanton representation of the monopole and a formal- 
ism for calculating multi-instanton Green's functions given by Brown et al. (1978). 
This calculation was spread over two papers that I wrote; setting up contour integral 
expressions for the Green's function was done in Appendix A of Adler (1978b), R45, 
and the final result for the monopole propagator, after evaluation of the contour in- 
tegrals and considerable algebraic simplification, was given in Appendix A of Adler 
(1979a), R46. (The fact that many lengthy expressions for parts of the Green's func- 
tion collapsed, after algebraic rearrangement, into simple formtilas, stiggcstcd that 
there should be a more efficient way to find the monopole Green's function. Not 
long afterwards, Nahm (1980) gave a new representation for the monopole that per- 
mitted a much simpler calculation of the Green's function given in R46.) To check 
that the lengthy expression that I had obtained for the propagator really satisfied 
the differential equation for the Green's function, I used numerical methods, calcu- 
lating the partial derivatives acting on the propagator by finite difference methods 
on a very fine mesh. From numerical calculations based on the propagator formula, 
it was clear that a single monopole background would not lead to confinement; all 
that happened was that a Coulombic attractive — 1/r potential was reversed into 
a repulsive 1/r potential for large quark separations, a result that could have been 
anticipated from the large distance structure of the monopole field. 

Not yet ready to give up on the monopole background idea, I then wrote two 
papers speculating that the Prasad-Sommerfield monopole might be a member of a 
larger class of solutions, in which the point at which the monopole Higgs field van- 
ishes is extended to a higher-dimensional region, and in particular to a "string" -like 
configuration with a line segment as a zero set. In the first of these papers (Adler 
1979b) I studied small deformations around the Prasad-Sommerfield monopole and 
found several series of such deformations. For normalized deformations I recovered 
the monopole zero modes that had already been obtained by Mottola (1978, 1979), 
but I found that "if an axially symmetric extension exists, it cannot be reached 
by integration out along a tangent vector defined by a nonvanishing, non-singular 
small-perturbation mode" . This work was later extended into a complete calculation 
of the perturbations around the Prasad-Sommerfield solution by Akhoury, Jun, and 
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Goldhaber (1980), who also found "no acceptable nontrivial zero energy modes." 
In my second paper, Adler (1979c), I employed nonperturbative methods and sug- 
gested that despite the negative perturbative results, there might still be interesting 
extensions of the Prasad-Sommerfield solution with extended Higgs field zero sets. 

At just around the same time, Erick Weinberg wrote a paper (Weinberg, 1979b) 
extending an index theorem of Callias (1978) to give a parameter counting theorem 
for multi-monopole solutions. Weinberg concluded that "any solution with n units of 
magnetic charge belongs to a (4n — l)-parameter family of solutions. It is conjectured 
that these parameters correspond to the positions and relative U{1) orientations of 
n noninteracting unit monopoles". For n = I, his results agreed with the zero-mode 
counting implied by Mottola's explicit calculation. Weinberg and I were aware of 
each other's work, as evidenced by correspondence in my file dating from March 
to June of 1979, and references relating to this correspondence in our papers Adler 
(1979c) and Weinberg (1979b). 

My contact with Clifford Taubes was initiated by an April, 1979 letter from 
Arthur Jaffe, after I gave a talk at Harvard while Jaffe, as it happened, was visiting 
Princeton! In his letter, Jaffe noted that I was working on problems similar to those 
on which his students were working, and enclosed a copy of a paper by Clifford 
Taubes. (This preprint was not filed with Jaffe's letter, so I am not sure which of 
the early Taubes papers listed on the SLAC Spires archive that it was.) Jaffe's let- 
ter initiated telephone contacts with Taubes and some correspondence from him. 
On Jan. 6, 1980 Taubes wrote to me that he was making progress in proving the 
existence of multi-monopole Prasad-Sommerfield solutions, and in this letter and a 
second one dated on January 18, 1980 he reported results that were relevant to my 
conjectures on the possibility of deformed monopoles. His results placed significant 
restrictions on my conjectures; in a letter dated Feb. 1, 1980 I wrote to Lochlainn 
O'Raifeartaigh, who had also been interested in axially symmetric monopoles, say- 
ing that "On thinking some more about your paper (O'Raifcartaigh's preprint was 
unfortunately not retained in my files) I realized that the enclosed argument by 
Cliff Taubes is strong evidence against n = 2 monopoles involving a line zero. What 
Taubes shows is that a finite action solution of the Yang-Mills-Higgs Lagrangian 
cannot have a line zero of arbitrarily great length; hence if n = 2 monopoles con- 
tained a line zero joining the monopole centers, the monopole separation would be 
bounded from above. But this seems unlikely...". This correspondence and the re- 
sult of Taubes was mentioned at the end of the published version, Houston and 
O'Raifeartaigh (1980). 

As a result of our overlapping interests, I arranged for Taubes to make an informal 
visit, of two or three months, to the IAS during the spring of 1980. Clifford had 
expressed interest in this, he noted in a recent email, in part because Raoul Bott 
had suggested that he visit the Institute to get acquainted with Karen Uhlenbeck, 
who was visiting the IAS that year. In the course of his visit he met and interacted 
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with Uhlcnbcck, who, along with Bott, had a major impact on his development as 
a mathematician. 

Taubes began the visit by looking at my conjecture of extended zero sets, but 
after a while told me that he could not find an argument for them. Partly as a result 
of his work, I was getting disillusioned with my own conjecture, so I asked him what 
was happening with his attempted proof of multi-monopole solutions. Taubes replied 
that he was stuck on that, and not sure whether they existed. I then mentioned to 
him Erick Weinberg's parameter counting result, which strongly suggested a space of 
moduli much like that in the instanton case, where looking at deformations correctly 
suggests the existence and structure of the multi-instanton solutions. To my surprise, 
Taubes was not aware of Erick's result, and knowing it impelled him into action on 
his multi-monopole proof. Within a week or two he had completed a proof, and 
wrote it up on his return to Harvard. (Thus, there was a parallel to what happened 
a year before with respect to solutions of the first order Ginzburg-Landau equations. 
In that case, Taubes had heard a lecture at Harvard by Erick Weinberg on param- 
eter counting for multi-vortex solutions (written up as Weinberg, 1979a) and then 
went home and came up with his existence proof for multi- vortices, Taubes (1980). 
The vortex work provided the initial impetus for Taubes' turning to the monopole 
problem.) In his paper Taubes showed that "for every integer N ^ there is at 
least a countably infinite set of solutions to the static SU (2) Yang-Mills-Higgs equa- 
tions in the Prasad-Sommerfield limit with monopole number N. The solutions are 
partially parameterized by an infinite sublattice in Sn{R^), the AT-fold symmetric 
product of and correspond to noninteracting, distinct monopoles." This quote 
is taken from the Abstract of his preprint "The Existence of Multi-Monopolc Sohi- 
tions to the Static, SU (2) Yang-Mills-Higgs Equations in the Prasad-Sommerfield 
Limit" , which was received on the SLAG Spires data base in June, 1980, and which 
carried an acknowledgement on the title page noting that "This work was completed 
while the author was a guest at the Institute for Advanced Studies, Princeton, NJ 
08540". His preprint also ended with an Acknowledgment section noting his con- 
versations with me, with Arthur Jaffe, and with Karen Uhlenbeck, as well as the 
Institute's hospitality. The proof was not published in this form, however, but in- 
stead appeared (with acknowledgments edited out at some stage) as Chapter IV of 
the book Jaffe and Taubes (1980) that was completed in August of 1980. The mul- 
timonopole existence proof was a milestone in Taubes' career; in a recent exchange 
of emails relating to the events described in this section, Taubes commented on his 
visit "to hang out at the IAS during the spring of 1980. It profoundly affected my 
subsequent career...". He went on to further investigations of monopole solutions, 
that lead him to studies of 4-manifold theory which have had a great impact on 
mathematics. 

O'Raifeartaigh, who had been following the monopole work at a distance, invited 
me during the spring of 1980 to come to Dublin that summer to lecture on my papers. 
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However, since Taubes had much more interesting results I suggested to Lochlainn 
that he ask Chfford instead, and Taubes did go to Dubhn to lecture. After Clifford's 
visit, I redirected my search for semiclassical confinement models to a study of 
nonlinear dielectric models by analytic and numerical methods, in collaboration 
with Tsvi Piran; these models do give an interesting class of confining theories, and 
are described in the following section. Based on the observation that the Yang-Mills 
action is multiquadratic (that is, at most quadratic in each individual potential 
component), Piran and I also applied the same numerical relaxation methods to 
give an efficient method for the computation of axially symmetric multimonopole 
solutions. (This was done mainly to illustrate the computer methods, since by then 
exact analytic 2-monopolc solutions had appeared; see Forgacs, Horvath and Palla 
(1981) and Ward (1981).) The numerical methods that Piran and I developed were 
described in our Reviews of Modern Physics article Adler and Piran (1984), R47 
that marked the completion of the research program on confining models, and as a 
by-product, on monopoles. 



Confinement Models 

Having seen that monopole backgrounds would not confine, I turned my attention 
to another type of semi-classical model, proposed in various forms by Savvidy (1977) 
(see also Matinyan and Savvidy (1978)) and Pagels and Tomboulis (1978). The basic 
idea is to do electrostatics with Abelianized quark charges, and with the fundamen- 
tal QCD action replaced by a renormalization group improved effective action, in 
which the gauge coupling is replaced by a running coupling, that is taken to be a 
function solely of the field strength squared. Although use of the running coupling 
is only justified by the renormalization group in the ultraviolet regime of large field 
strengths, the model assumes that the same functional form can be extrapolated to 
small field strengths as well. This leads to electrodynamics with a nonlinear, field- 
dependent dielectric constant that develops a zero for small squared field strengths. 
Because the only dynamical input from QCD is the running coupling, the model, as 
Prank Wilczek later remarked to me, can be considered as a very simple embodiment 
of the idea that "asymptotic freedom" should be associated with "infrared slavery" . 
Since the running coupling involves a scale mass, the model directly incorporates 
the phenomenon of dimensional transmutation. Pagels and Tomboulis conjectured, 
on the basis of various evidence, that the nonlinear dielectric model would confine, 
but did not have a proof. 

In the paper Adler (1981), R48, I analyzed the effective action model in detail 
and proved that it confines quarks. The argument starts from a Euclidean form 
of the Feynman path integral, and shows that the static potential is the minimum 
of the effective action in the presence of sources. I then specialized to the leading- 
logarithm effective action, and showed that the action minimum is associated with a 
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field configuration in which a color magnetic field fills in whenever the color electric 
field is less than the minimum magnitude k at which the effective action is minimized. 
This reduces the action minimization to an electrostatics problem, to which one can 
apply flux conversation estimates due to 't Hooft (1974b). In the nonlinear dielectric 
model context, these estimates show that the static potential is bounded from below 
by kQ{R — r), with Q the Abclianized quark charge, with r a constant, and with R 
the interquark separation. Hence the potential increases linearly for large i?, and the 
model confines. In an Appendix to R48, I discussed how a one-loop renormalization 
group exact, leading-logarithm running coupling can be obtained, by a coupling 
constant transformation, from the more usual two-loop renormalization group exact 
running coupling (to which the confinement argument also applies). 

When I presented this proof of confinement by the nonlinear dielectric model 
at a Department of Energy sponsored workshop in Yerevan, Armenia in 1983, an 
interesting dialogue with the Soviet physicist A. B. Migdal ensued. When I started 
to talk, and said what I was going to prove, Migdal stood up and stated that it was 
well-known that the Savvidy (-Pagels-Tomboulis) model did not confine, and gave 
some reasons. I then presented my proof, after which Migdal stood up, and said 
words to the effect that the problem is that there are too many confining models! 
As we shall sec, there is really only one other model, the "dual superconductor" 
model, which like the nonlinear dielectric model is motivated by the idea of a color 
magnetic condensate, but describes this with a different dynamics. 

Following publication of R48, I wrote a paper (Adler, 1982a) formalizing the 
approximations (further discussed below) needed to get an Abclianized effective 
action model from the functional integral for QCD. I then turned to the problem 
of understanding in detail how the leading-log model gives a confining potential. 
Since it was clear that this would, at least in part, involve numerical solution of the 
nonlinear differential equations involved, I brought in Tsvi Piran as a postdoc. Tsvi 
had worked extensively in the numerical solution of the Einstein equations of general 
relativity, and came to the IAS with the understanding that he would continue this 
and other interests he had in astrophysics, but would also collaborate with me in the 
numerical solution of the leading-log model equations. Because of my work on the 
induced gravity program, this collaboration didn't start immediately after Tsvi's 
arrival, but once we began work, Tsvi taught me a great deal about setting up an 
interactive program to numerically solve partial differential equations. As is typical 
in doing numerical work, most of our time was spent developing and testing our 
computer codes, which took many months. To guard against programming errors, 
Tsvi and I each independently wrote our own programs, which once debugged gave 
identical results. The final production runs took a total of less than two days running 
time on the then new IAS VAX 11/780 computer, using mesh sizes of up to 100 x 100 
to resolve details of the confinement domain. 

The equations to be solved, in the leading-log model with three light fermion 
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flavors and scale mass k, are 

f = QS{x)S{y)[S{z - a) - S{z + a)] , 

e(E) = i6olog(£'A') , E=\E\ , 
, 9 

We also studied the leading-log-log model, in which the two-loop exact form of the 
running coupling is used. We originally tried to solve the equations directly in terms 
of the scalar potential A^, but found that the numerical programs were unstable. 
I then introduced a flux function reformulation of the problem (suggested by sim- 
ilar methods used in plasma physics), and this gave a stable, rapidly convergent 
iteration showing formation of a flux-confining free boundary. To understand the 
structure of the free boundary, Tsvi suggested that a paper of Fichcra on elliptic 
equations that degenerate to parabolic would be relevant, and this indeed was the 
CcLSC, clS described in Appendix A of our review R47. Prior to writing the review, we 
wrote two shorter papers. The first (Adler and Piran, 1982a, R49) demonstrated flux 
confinement and gave a numerical determination of the large R asymptotic form of 
the interquark potential, which contains a leading term linear in R, and a subdom- 
inant term proportional to log k^^^R. The second (Adler and Piran, 1982b, R50) 
gave compact, accurate functional forms that fit the computed static potentials for 
both the leading-log and the leading-log-log models. 

One nice feature of the leading-log model (as well as the Icading-log-log exten- 
sion) is that its small distance and large distance limiting cases can be approximated 
analytically. In the small distance limit, I devised an analytic perturbation method 
(Adler, 1982b) which shows that the potential has the standard form of a Coulomb 
potential with a logarithmic correction that is expected from perturbative QCD, 
permitting the parameter k of the model to be related to the QCD scale mass. With 
this identification, the model has no adjustable parameters. In the large distance 
limit, an ingenious analysis by Lehmann and Wu (1984) showed that the confine- 
ment domain is an ellipsoid of revolution, with maximum diameter growing as i?^/^ 
with the interquark separation, and gave an analytic expression for the free bound- 
ary shape for large R as well as the subdominant term in the potential. Thus, the 
model yields a "fat" bag, rather than a cylindrical confinement domain of uniform 
radius; however, Lehmann told me at the time that he believed the true QCD be- 
havior would show a constant-radius cylindrical domain, and he appears now (see 
below) to be right. As discussed in the articles I wrote with Piran, the analytic 
forms for both small and large R agreed very well with our numerical results, giving 
confidence that the numerical analysis had been carried out correctly. 

How well do the nonlinear dielectric models agree with QCD? There are two 
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aspects to this question, whether they give satisfactory static potentials, and whether 
they describe the flux conflnement domain that is reahzed in QCD. To assess the 
static potentials tabulated in R50, one has to do a detailed fit to heavy quark 
spectroscopic data. This was done in papers of Margolis, Mendel, and Trottier (1986) 
and of Crater and Van Alstine (1988), both of which concluded that the log-log model 
potential is in good agreement with experimental data on heavy quark systems, with 
reasonable values of the quark masses. The fit of Margolis, Mendel, and Trottier used 
a value of ^ms — 0.270 GeV, while that of Crater and Van Alstine used a value of 
h^Ms = 0.215GeV (note that their A is the k^/^ of R50, which is related to A^^ 
by = 0.959k^/^). These values of are in reasonable accord with the value 
^MS ~ 0.218 GeV that Piran and I had quoted in R50, obtained by requiring the best 
fit of our potential to Martin's phenomenological potential for heavy quark systems. 
These values of Aj^^ should be compared with the three light quark experimental 
value ~ 0.369 GeV (Hinchliffe, 2005). For a simple extrapolation from the 

asymptotically free regime to the confining regime of QCD, the nonlinear dielectric 
model does reasonably well in accounting for heavy quark spectroscopy. 

As already noted, the confinement domain in the nonlinear dielectric models is 
an ellipsoid of revolution, with width increasing with the quark separation R. Let 
p be the cylindrical radial coordinate, and z the coordinate along the axis of the 
cylinder. On the medial plane z = 0, various quantities of interest can be computed 
in the large- i? limit directly from the Lehmann-Wu asymptotic solution. The radius 
of the confinement domain on the medial plane is 

p. = h4(^)* . 

and the value of on the medial plane is 

from which one can check that the flux integral gives 27r Jq"" pdp\D\ = Q. The proflle 
of \D\ is evidently parabolic, and scales with oc i?2. 

To compare this "fat bag" confinement domain with QCD, one must rely on lat- 
tice simulations, since in real- world QCD, the confining flux tube breaks up through 
quark-antiquark pair formation before the asymptotic regime is reached. Assuming 
that the lattices used are large enough to accurately approximate the continuum 
theory, the data from simulations that have been carried out show a confinement 
domain of constant diameter in the limit of large R, as discussed and referenced in 
the book of Ripka (2004) . The details of the simulated confinement domain favor the 
"dual superconductor" model, in which QCD is regarded as a dual of a Ginzburg- 
Landau superconductor, with magnetic monopole pairs replacing the Cooper pairs 
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of superconductivity. In this picture, in addition to the color fields, there is a dy- 
namical variable corresponding to the monopolc condensate. A numerical analysis 
of flux confinement in a dual superconductor, using the methods described in my 
review with Piran R47, has been given by Ball and Caticha (1988), who give plots 
of the confinement domain; for further details and references, see both Ripka (2004) 
and the review of Baker, Ball, and Zachariascn (1991). For appropriate values of 
the three dual superconductor model parameters (a magnetic charge g, which can 
be related to an effective QCD coupling e^s by the Dirac quantization condition 
g = 27r/eeff, a scalar magnetic condensate mass m^^, and a gauge gluon mass my), 
good fits to the lattice simulations are obtained, and the dual superconductor model 
also gives a phenomenologically satisfactory static potential. (In a recent preprint. 
Haymaker and Matsuki (2005) argue that in lattice comparisons, the continuum 
my gives rise to two parameters that must be fitted, making four parameters in 
all including g.) However, since the dual superconductor gives a Coulomb poten- 
tial at short distances, without logarithmic modifications, the dual superconductor 
parameters cannot be directly related to the QCD scale as was possible for 

the scale parameter k of the nonlinear dielectric model. As a limiting case, the dual 
superconductor model gives the standard bag model with a field discontinuity at the 
boundary; a numerical solution of this model is also discussed in Ball and Caticha 
(1988). 

Although the nonlinear dielectric model and the dual superconductor model suc- 
cessfully describe important aspects of confinement in QCD, major steps would be 

needed to incorporate such classical action models into a proof of confinement. To 
do so one wotild have to prove that the true energy of a widely separated quark- 
antiquark pair in QCD is bounded from below by the energy calculated in one or 
the other of the two models. This would require achieving precise control over the 
qualitative approximations involved in the models, which include a mean-field ap- 
proximation to the functional integral as discussed in Adler (1982a), the replacement 
of the exact QCD effective action by the model effective action, and replacement 
of the octet of color quark charges by Abelianized effective charges lying in the 
maximal commutative subgroup. Although, as I argued in the case of the nonlin- 
ear dielectric model in Adler (1982a), these simplifications of the full problem are 
plausible, replacing qualitative approximations by precise mathematical statements 
with error estimates will be no small task. 

In any flux tube picture of confinement based on Abelianized charges, such as 
either the nonlinear dielectric model or the bag limit of the Ginzburg-Landau dual 
superconductor model, the string tension scales as the Abelianized quark charge, 
or as the square root of the corresponding Casimir. In a paper with Neuberger 
(Adler and Neuberger, 1983, R51), we pointed out that in the large- A^c limit of 
SU (Nc) gauge theory, the string tension scales with the Casimir when changing from 
fundamental to adjoint representation quarks; hence to the extent that flux tube 
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models give a good description of confinement in A^c = 3 QCD, different confinement 
mechanisms appear to be at work in QCD and in its large Nc limit. 



Chiral Symmetry Breaking 

Not long after I had finished the review paper R47 with Piran summarizing our 
work on confinement models, Anne Davis suggested looking at another outstanding 
problem in QCD, that of chiral symmetry breaking. After studying the relevant 
literature (reviewed in the Introduction to our paper Adler and Davis (1984), R52), 
we decided to focus on setting up and solving a superconductor-like gap equation for 
fermion pairing in Coulomb gauge, systematically imposing the axial-vector current 
Ward identities to get the correct renormalization procedure. This method permitted 
us to study pairing using a Lorentz vector instantaneous confining potential with 
V cx r, getting infrared- finite results for physical quantities without imposing ad 
hoc infrared cutoffs. The model gives spontaneous breaking of chiral symmetry, but 
with values of the quark condensate {uu) and the pion decay constant that 
are considerably too low when the phenomenological confining potential (or string 
tension) is used as input. Similar results were also found by a group at Orsay, and 
we learned later that the utility of the axial-vector Ward identities in deriving the 
gap equation had also been noted by Dclbourgo and Scadron (sec the reprinted 
papers R52 and R53 for references). Extensions of the model of R52 to the finite 
temperature case were later discussed by Davis and Matheson (1984), Alkhofer and 
Ammundsen (1987), and Klevansky and Lemmer (1987). 

In a subsequent paper (Adler, 1986, R53) that I wrote for the Nambu Festschrift, 
I reviewed the work of various groups on gap equation models, and also noted a 
problem. In order for there to be no explicit breaking of chiral symmetry in the 
gap equation model, the instantaneous potential must be the time component of a 
Lorentz vector, so that it contains factors 70 that anticommute with 75. However, 
experimental data on heavy quark spectroscopy show that the confining part of 
the potential is predominantly Lorentz scalar, and using a Lorentz scalar potential 
in the gap equation model would lead to explicit violation of chiral symmetry, and 
therefore invalidate the model. This suggests that the approximations leading to the 
gap equation model are not valid for the confining part of the potential. In R53, 1 also 
gave equations that I had worked out for a retarded extension of the instantaneous 
potential model. The original intention had been for a graduate student in either 
Princeton or Cambridge to work on solving the extended model, but in view of 
the Lorentz structure problem this was not done (a covariant treatment of the gap 
equation model was later given by von Smekal, Amundsen, and Alkofer (1991)). 
For various proposals for addressing the Lorentz structure issue, see Lagae (1992), 
Szczepaniak and Swanson (1997), and Bicudo and Marques (2004). 

Shortly after the paper R52 was out, Cumrun Vafa, then a Princeton graduate 
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student, had a few conversations with me about his attempts to turn the Banks- 
Casher (1980) eigenvalue density criterion for chiral symmetry breaking into a proof 
that chiral symmetry breaking occurs in QCD. (For recent progress in applying the 
Banks-Casher criterion in the large- AT^ limit, see Narayanan and Neuberger (2004).) 
I didn't have much in the way of concrete suggestions to offer, and Cumrun started 
also talking to Edward Witten, who very sagely suggested looking at a different 
problem, that of studying whether parity conservation can be spontaneously bro- 
ken in QCD. This problem proved tractable, and their papers (Vafa and Witten, 
1984a,b), proving that parity is not spontaneously broken in vector-like gauge theo- 
ries (and similarly for the isospin and baryon number symmetries), became part of 
Vafa's thesis. The difference between the parity problem and the chiral symmetry 
problem can be understood by considering their respective order parameters. If par- 
ity is spontaneously broken, the pseudoscalar order parameter uij^u will receive a 
vacuum expectation. When the fermions are integrated out. one obtains a Lorentz 
invariant, parity-nonconserving operator functional X of the gluon fields that is real 
in Minkowski space, but picks up a factor of i when rotated to Euclidean space. 
This, together with positivity of the Euclidean space Dirac fermionic determinant 
in a vector-like theory, is the basis of the Vafa- Witten proof that adding a small mul- 
tiple of X to the action cannot make the ground state energy lower. In the chiral 
symmetry problem, the relevant order parameter is the parity conserving but chiral 
symmetry breaking scalar operator uu, which when the fermions axe integrated out 
leads to a functional X' of the gluon fields that remains real when rotated to Eu- 
clidean space. Hence the Vafa- Witten argument suggests that the energy minimum 
may lie at a nonzero value of X' , but such a local analysis cannot find the global 
minimum, and hence does not give a proof of chiral symmetry breaking. Rigor- 
ous lattice inequalities given by Weingarten (1983) give a proof of chiral symmetry 
breaking only when additional strong assumptions arc made, including the existence 
of the continuum limit and the confinement of color, together with use of anomaly 
matching conditions. 

Over twenty years later, the problem of proving the breakdown of chiral sym- 
metry in QCD is still open, as is that of proving confinement. In fact, there is 
considerable evidence that chiral symmetry breaking and confinement in QCD are 
related phenomena. For example, lattice simulations such as D'Elia et al. (2004) 
show that the deconfining and chiral transitions coincide; gap equation models of 
the type studied in R52 find chiral symmetry breaking for a confining potential but 
not for a Coulomb potential, and lattice inequalities of the type studied by Wein- 
garten also need confinement as an ingredient to show chiral symmetry breaking. 
Thus it appears that both of these outstanding problems in QCD are aspects of 
the larger problem of proving that QCD exists and has a mass gap, which is one of 
the seven Clay Foundation Millennium Problems in mathematics and mathematical 
physics. Perhaps in this century, with the added incentive of a $1 million reward, 
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rigorous proofs of confinement and chiral symmetry breaking in QCD will be found! 
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8. Overrelaxation for Monte Carlo and Other Algorithms 



Introduction 

As I have already noted, the investigations described in the previous chapter all 
involved extensive computer work. This got me interested in the issue of algorithms 
more generally, and led to two distinct research directions in the years that followed. 
One involved generalizing the acceleration methods for solving partial differential 
equations to the related problem of Monte Carlo simulations, as discussed in the first 
section that follows. The second involved neural networks and pattern recognition, 
and led among other things to work on image normalization methods, described 
briefly in the second section of this chapter. 



Overrelsixation to Accelerate Monte Carlo 

In preparation for numerically solving the partial differential equations for the 
leading-log models, I did general reading on numerical methods for handling partial 
differential equations. This taught me about the critical slowing down problem - the 
fact that as one refines meshes to get more accurate numerical solutions, the rate 
of convergence of the iterations slows down. I also learned about various strategies 
devised for defeating critical slowing down, and in particular about the successive 
over-relaxation (SOR) modification of the standard Gauss-Seidel iteration. In a 
Gauss-Seidel iteration of a positive functional, one replaces each successive variable 
by the value that locally minimizes the functional. In SOR, one builds in a system- 
atic overshoot beyond the local minimum, with the amount of overshoot tuned to 
the degree of mesh refinement, yielding more rapid convergence as a result. In the 
work of R47, Piran and I used SOR in all of our iterative solutions, and achieved 
substantial gains in convergence speed on our finest meshes. 

I became interested in Monte Carlo algorithms because it was clear that lattice 
gauge theory simulations probably would be the only way that one could study de- 
tails of the structure of the flux confinement domain in QCD. I knew from talks that 
I had heard in Princeton that there were two main Monte Carlo methods in use, the 
Metropolis method and the heat bath method, and also that the folk wisdom at the 
time was that heat bath was the best one could do, since it corresponded to "nature's 
way" of achieving thermal equilibrium. However, since the zero temperature limit of 
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heat bath just corresponds to a Gauss-Seidel iteration, which I knew could be accel- 
erated by SOR, I suspected that the conventional wisdom was wrong, and that there 
should be extensions, to Monte Carlo thermalizations, of the standard acceleration 
methods for the solution of differential equations. Since the monopole numerical 
work had brought out the fact that the Yang Mills action is multiquadratic, I de- 
cided to study this question in the simple context of multiquadratic actions, where 
the question becomes whether for quadratic actions, the SOR method for differen- 
tial equations has an extension to Monte Carlo thermalization. This is the question 
addressed in Adler (1981), R54, where I showed that SOR does indeed extend to the 
thermalization of multi-quadratic actions, by explicitly constructing in Eqs. (14a, b) 
the transition probability that obeys detailed balance when an overrelaxation pa- 
rameter is included in the iteration. (Note that the normalization factors in these 
equations have the tt in the correct place, but the other factors inverted; this is 
corrected in Eqs. (9) and (11) of my later paper R55. The argument of R54 does 
not involve the normalization factors, and is unaffected.) For this overrelaxed ther- 
malization, I showed that the means of the thermalized variables iterate according 
to standard SOR; since standard SOR accelerates Gauss-Seidel, this implies that 
there should be a corresponding acceleration of the thermalization process as well. 

The 1981 paper R54 gave the earliest indication that Monte Carlo methods could 
be accelerated to improve critical slowing down, and for this reason was conceptu- 
ally important, as well as having later applications and extensions. To the best of 
my knowledge, I am supported by the literature on the subject, in stating that R54 
first introduced acceleration methods into Monte Carlo. Two compilations of Monte 
Carlo articles edited by Binder contain literature siuvcys, Binder ct al. (1987), and 
Swendsen, Wang, and Ferrenberg (1992), relating to the critical slowing down prob- 
lem. In both surveys, the earliest listed reference is from 1983; neither survey cites 
my 1981 paper (or Whitmer's 1984 paper - see below), although some of the cited 
articles do reference these papers. 

I didn't immediately continue work on Monte Carlo acceleration myself, but 
suggested it as a thesis research area to my Princeton University graduate student 
Charles Whitmer. He applied the method to (f)'^ and Higgs actions that are point split 
on a lattice with unit displacement jl according to (j)^{x) 0^(x)0^(a: -|- ajl), which 
makes them multiquadratic, and in the paper Whitmer (1984) reported improvement 
over conventional heat bath Monte Carlo. I got interested in the subject again a few 
years later, after Goodman and Sokal (1986) (who knew about the SOR method of 
R54 and Whitmer's paper) proposed a stochastic extension of multigrid methods, 
and Creutz (1987) and Brown and Woch (1987) gave a simple implementation of 
the SOR idea for lattice gauge theory plaquette actions. This latter development 
eliminated the need for the problematic gauge fixing that I had used in R54 to 
keep the latticized gauge action multiquadratic, and opened the way to practical 
applications of SOR to gauge theory Monte Carlo studies. In the spring of 1987, 
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I went to Torino, Italy with my daughter Victoria, who had been eager to visit 
Europe after finishing her high school requirements. During this sabbatical term I 
was a visitor at the Institute for Scientific Interchange (ISI), at the invitation of 
Mario Rasetti and my former IAS colleague Tullio Regge; I also had an office at 
the University of Torino that I used a couple of days a week. Although I had been 
spending considerable time over the previous few years working on quaternionic 
quantum mechanics (see the next chapter), I decided on this trip to return to my 
old interest in Monte Carlo SOR, stimulated by the fact that experts in the Monte 
Carlo field had started to get interested. In a paper that I wrote while at ISI (Adler, 
1988a, R55) I gave a much more detailed analysis of overrclaxcd thermalization for a 
quadratic action, and also gave extensions of the method to non-quadratic actions, 
including SU (n) gauge theory. 

After my return to the IAS from this sabbatical, I continued work on Monte- 
Carlo algorithms for several more years. In a paper written in the fall of 1987 after I 
returned from Italy (Adler, 1988b, R56), I gave an elegant formal analysis showing 
that the general linear iteration u' = Mu + Nf corresponding to a splitting 1 = M + 
NL of the quadratic form L for a Gaussian action, has a corresponding stochastic 
generalization 

P{u u') = {P/TT)^/^{detr)^/^ ex.p[-{u' -Mu- Nff/3r{u' -Mu- Nf)] , 

with r = ^{L~^ — ML~^M'^)~^ a modified temperature matrix. This extends the 
SOR thermalization of R54 to a general linear iterative process. Later in the same 
academic year, I gave in Adler (1988c) a Metropolis variant of the SU{n) method 
given in R55, that extended the method for the Wilson action used by Creutz and by 
Brown and Woch to general overrelaxation parameter lo. In collaboration with Cyan 
Bhanot, a former IAS member and a Monte Carlo expert, we made a numerical study 
of the SU (2) version of this algorithm, with results reported in Adler and Bhanot 
(1989), R57. (Growing out of this collaboration, Bhanot spent several years as a half- 
time member of the IAS in the early 1990's, in the course of which we wrote a number 
of further papers on a variety of Monte Carlo acceleration methods.) I also gave talks 
at lattice conferences; at the biennial Lattice Gauge conference Lattice 88, held at 
Fermilab that year, I gave a plenary talk reviewing work on algorithms for pure 
gauge theory, focusing primarily on the theory and application of overrelaxation 
methods (Adler, 1989, R58). Monte Carlo overrelaxation has become a standard 
part of the lattice gauge theorist's tool kit; for a sampling of recent applications, see 
Kiskis, Narayanan, and Neuberger (2003), Holland, Pepe, and Wiese (2004), Meyer 
(2004), Pepe (2004), Shcheredin (2005), and de Forcrand and Jahn (2005). 
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Image Normalization 

During the 1990s, I interspersed my work on quaternionic quantum mechanics and 
particle physics with work on aspects of neural networks and pattern recognition. 
My neural network interests involved an analog device that I patented (Adler, 1993) 
and an article (Adler, Bhanot, and Weckel, 1996) analyzing its algorithmic aspects. 
In pattern recognition, from lunchtime conversations with Joseph Atick and Norman 
Redlich, I got interested in the problem of extracting those features of an image that 
are invariant under a symmetry transformation. This problem is closely analogous 
to that of extracting those features of a gauge potential that are gauge-invariant, 
and in Adler (1998), R59 I gave a general formal solution, based on imposing image 
normalizing constraints analogous to gauge-fixing constraints. I have reprinted here 
only the first two sections of this unpublished article (without references), in which 
the general theory is set up; further sections of the article give applications to a 
variety of viewing transformations of a planar object. Shortly afterwards, when one 
of the IAS string theory postdocs was interested in switching to a computer-related 
career, I suggested applying my methods to the problem of the similarity and affine 
normalization of partially occluded planar curves (such as the boundary of a planar 
object). We worked this out together and it was published as Adler and Krishnan 
(1998), R60. The excerpt R59 of the general paper that is reprinted here gives the 
background needed to follow the extension of the planar algorithm to curve segments 
given in R60. 
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9. Quaternionic Quantum Mechanics, Trace Dynamics, and 
Emergent Quantum Theory 



Introduction 

During the twenty years from 1984 to 2004, a large part of my time was spent on 
investigations into foundational areas of quantum mechanics. Most of my research 
from this period was later presented in two books that I wrote, Quaternionic Quan- 
tum Mechanics and Quantum Fields (Oxford University Press, New York, 1995) 
and Quantum Theory as an Emergent Phenomenon (Cambridge University Press, 
Cambridge, 2004). I have not included in this reprint volume any research papers 
incorporated (some considerably improved) into the two books, since this would 
be infcasiblc because of length limitations. So what 1 discuss in this chapter arc a 
few papers dealing with quaternionic topics written during the period between the 
two books, together with a brief description of how I got interested in quaternionic 
quantum theory, and later on, in the possibility of a pre-quantum theory. 

Quaternionic Quantum Mechanics 

My interest in quaternionic quantum mechanics grew out of my interest in the Harari 
(1979)-Shupe (1979) model for composite quarks and leptons. They postulated an 
order-dependence for the preon wave functions (e.g., TTV, TVT, VTT were con- 
sidered to be three distinct color states) , which suggested that quantum theory over 
a noncommutative field might be involved. I was never able to use quaternions or 
related ideas to make a successful preon model, either during the period before my 
book (see Adler, 1979, 1980) or after (Adler, 1994a), but the issues raised, and 
interactions with key people acknowledged in the Preface of the 1995 volume, led 
me to undertake a systematic study of quaternionic quantum mechanics. Perhaps 
the most important new result contained in my papers (Adler, 1988) and in my 
book is the fact that the S'-matrix in quaternionic scattering theory is complex, 
not quaternionic, which was a surprise to the experts in the field and invalidated 
proposed searches (such as Peres, 1979) for quaternionic effects manifested through 
noncommuting scattering phases. I also clarified the relationship between time rever- 
sal symmetry in quaternionic quantum theory (where it is unitary) and in complex 
quantum theory (where it is antiunitary) , proved that positive energy quaternionic 
Poincare group representations are complex and not intrinsically quaternionic, and 
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gave a quaternionic generalization of projective group representations (to which I 
shall return shortly). These were but a few of the many topics dealt with in my 
1995 book. My quaternionic investigations also motivated work I did in new direc- 
tions in standard quantum mechanics, such as a paper that I wrote showing that 
SU{3) X SU{12) is the minimal grand unified theory in which, species by species for 
charged fermions, no Dirac sea is required (Adler, 1989). 

After my book on quaternionic quantum mechanics was completed, a number 
of papers that I wrote with collaborators clarified issues that were left unresolved, 
or were inadequately treated, in the book. One of these issues dealt with the non- 
adiabatic geometric phase in quaternionic Hilbert space. This was discussed in my 
book, but on a visit to the IAS, Jeeva Anandan pointed out that my treatment 
was incomplete, and sketched what was needed to improve it. I filled in the details 
and drafted a manuscript, which became a joint paper (Adler and Anandan, 1996, 
R61) that was published in the Larry Horwitz Festschrift issue of Foundations of 
Physics. A second issue that was left hanging was the analog of coherent states 
in quaternionic quantum theory. My thesis student Andrew Millard and I studied 
this, and wrote a paper (Adler and Millard, 1996a, R62) giving the extension of 
the Perelomov coherent state formalism to quaternionic Hilbert spaces. We also 
showed that the closure requirement forces an attempted quaternionic generalization 
of standard coherent states based on the Weyl group to reduce back to the complex 
case, settling a question raised in discussions with me by John Klauder. The other 
issues that were dealt with after publication of the quaternionic book were the 
structure of quaternionic projective representations, and the relationship between 
standard complex quantum mechanics and the dynamics based on a trace variational 
principle that I had introduced in the field theory chapter of the 1995 book. These 
form the subject of the next two sections. 



Quaternionic Projective Group Representations 

Given two group elements b, a with product ba, a unitary operator representation Uh 
in a Hilbert space is defined hy UbUa = Uba- more general type of representation, 
called a ray or projective representation, is relevant to describing the symmetries of 
quantum mechanical systems. In his famous paper on unitary ray representations 
of Lie groups, Bargmann (1954) defines a projective representation as one obeying 
UbUa = Uba^{b, a), with Lo{b, a) a complex phase. 

This definition is familiar, and seems obvious, until one asks the following ques- 
tion: Bargmann's definition is assumed to hold as an operator identity when acting 
on all states in Hilbert space. However, we know that it suffices to specify the 
action of an operator on one complete set of states in Hilbert space to specify 
the operator completely. Hence why does one not start instead from the definition 
UbUa\f) = Uba\f)(^{f',b,a), with {|/)} one complete set of states, as defining a pro- 



February 2, 2008 2;58 Trim Size: 10.25in x 7.5in — Adventures in Theoretical Physics archive4 



Quantum Mechanics, Trace Dynamics, and Emergent Quantum Theory 109 

jective representation in Hilbcrt space? Let us call Bargmann's definition a "strong" 
projective representation, and the definition with a state-dependent phase a "weak" 
projective representation. Then the question becomes that of finding the relation 
between weak and strong projective representations. 

Although I have formulated this question here in complex Hilbert space, it arose 
and was solved in the context of quatcrnionic Hilbcrt space, where the phases 
Lo{f; b, a) are quaternions, which obey a non-Abelian group multiplication law iso- 
morphic to 50(3) ~ SU{2). The strong definition was adopted for the quaternionic 
case by Emch (1963, 1965), but in Sec. 4.3 of my book on quaternionic quantum 
mechanics I introduced the weak definition in order for quaternionic projective rep- 
resentations to include embcddings of nontrivial complex projective representations 
into quaternionic Hilbert space; the state dependence of the phase is necessary be- 
cause even a complex phase uj does not commute with general quaternionic rephas- 
ings of the state vector |/). I noted in my 1995 book that the weak definition can 
be extended to an operator relation by defining 

n{b,a) = Y,\fMf-,b,a){f\ , 

/ 

so that the weak definition then takes the form 

UbUa = Ubanib,a) , 

which gives the general operator form taken by projective representations in quater- 
nionic quantum mechanics. I also introduced in Sec. 4.3 of my book two special- 
izations of this definition, motivated by the commutativity properties of the phase 
factor in complex projective representations. I defined a multicentral projective rep- 
resentation as one for which 

[n{b,a),Ua] = [n{b,a),Ub]=0 

for all pairs b, a (note that in Eq. (4.51a) of my book, Uab should read Ui,a, so that the 
two conditions just given suffice), and I defined a central projective representation 
as one for which 

[n{b,a),Uc] = 

for all triples a, b, c. 

Subsequent to the completion of my book, I read Weinberg's first volume on 
quantum field theory (Weinberg, 1995) and realized, from his discussion in Sec. 2.7 
of the associativity condition for complex projective representations, that there must 
be an analogous associativity condition for weak quaternionic projective represen- 
tations. I worked this out (Adler, 1996, R63), and showed that it takes the operator 
form 

U-^n{c, b)Ua = n{cb, a)-'^n{c, ba)n{b, a) , 
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which by the definition of $7(6, a) shows that U~^Cl{c,b)Ua is diagonal in the basis 
{!/)}, with the spectral representation 

U-'n{c, h)Ua = J2 \fMf;cb,aMf; c, baMf; b, a){f\ . 

f 

On the basis of some further identities, I also raised the question of whether one can 
construct a multicentral representation that is not central, or whether a multicentral 
representation is always central. 

Subsequently, I discussed the issues of quaternionic projective representations 
with Andrew Millard. He explained them to his roommate Terry Tao, a mathematics 
graduate student working for Elias Stein, and at my next conference with Andrew, 
Tao came along and presented the outline of a beautiful theorem that he had devised. 
This was written up as a paper of Tao and Millard (1996), and consists of two parts. 
The first part is an algebraic analysis based on the spectral representation given 
above, which leads to the following theorem 

Structure Theorem: Let U be an irreducible projective representation of a 
connected Lie group G. There then exists a reraying of the basis \ f) under which 
one of the following three possibilities must hold. 

(1) U is & real projective representation. That is, uj{f; b, a) = u{b, a) is independent 
of I/) and is equal to ±1 for each b and a. 

(2) U is the extension of a complex projective representation. That is, the matrix 
elements {f\Ua\f') are complex and u!{f;b,a) = uj{b,a) is independent of |/) 
and is a complex phase. 

(3) U is the tensor product of a real projective representation and a quaternionic 
phase. That is, there exists a decomposition Ua = U^J2f \ f)'^a{f\, where the 
unitary operator has real matrix elements, aa is a quaternionic phase, and 
Ug = ±UgU^ for all b and a. 

Prom the point of view of the Structure Theorem, case (1) corresponds to the 
only possibility allowed by the strong definition of quaternionic projective represen- 
tations, as demonstrated earlier by Emch (1963, 1965), while case (2) corresponds to 
an embedding of a complex projective representation in quaternionic Hilbert space, 
the consideration of which was my motivation for proposing the weak definition. Spe- 
cializing the Structure Theorem to a complex Hilbert space, where case (3) cannot 
be realized, we see that in complex Hilbert space the weak projective representation 
defined above implies the strong projective representation; hence no generality is 
lost by starting from the strong definition, as in Bargmann's paper. 

The second part of the Tao-Millard paper is a proof, by real analysis methods, 
of a Corollary to the structure theorem, stating 
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Corollary 1: Any multicentral projective representation of a connected Lie group 
is central. 

This thus solved the question of the relation of centrality to multicentrality that 
I raised in my paper R63. 

Subsequent to this work, I had an exchange with Gerard Emch in the Journal of 
Mathematical Physics debating the merits of the strong and weak definitions. After 
a visit to Gainesville where we reconciled differing notations, we wrote a joint paper 
(Adler and Emch, 1997, R64) clarifying the situation, and reexpressing the strong 
and weak definitions in the language and notation often employed in mathematical 
discussions of projective group representations. 



Trace Dynamics and Emergent Quantum Theory 

My work on emergent quantum theory arose from the merging of two lines of 
thought. The first line of thought arose from answering the question of whether 
quaternionic quantum mechanics ameliorates the measurement problem of standard 
quantum mechanics; the answer is "no" , because quaternionic quantum theory still 
has a unitary time evolution, and so the usual problems persist. However, in the 
course of working this through I read some of the literature on the measurement 
problem in standard quantum theory, and came away convinced that there were real 
issues to be addressed. The second line of thought arose from my attempts to con- 
struct quaternionic quantum field theories. I found that the canonical quantization 
method could not be extended to the quaternionic case, and so I had to resort to 
an alternative formalism, which I variously called "generalized quantum dynamics" , 
"total trace dynamics", or finally, simply "trace dynamics", to generate operator 
equations without "quantizing" a classical theory. This was done by using a vari- 
ational principle based on a Lagrangian constructed as a trace of noncommuting 
operator variables, making systematic use of cyclic permutation under the trace op- 
eration. These ideas were developed in the paper Adler (1994b) and were described 
in Chapter 13 of my 1995 book; in Chapter 14, I suggested that the nonlinearity 
of trace dynamics could make it relevant for resolving the measurement problem 
in quantum theory. However, the problem of relating the trace dynamics formal- 
ism to the standard canonical formalism of complex quantum field theory remained 
unsolved. 

One of the questions I had posed to Andrew Millard was that of better under- 
standing trace dynamics, in the hope of finding a connection to standard quantum 
theory. After I arrived in Aspen in the summer of 1995, Andrew sent me a memo 
containing his discovery that in trace dynamics with a Weyl symmetrized Hamilto- 
nian and noncommuting boson degrees of freedom q^, Pr, the operator YlA^r^Pr] is 
conserved. I soon found that the generalization to include fermions is the conserved 
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operator that we denoted by (7, defined by 

= ^ [qr,Pr] - XI i^r,Pr} , 
r, boson r, fermion 

and that this operator is conserved as long as the trace Hamiltonian has no fixed 
operator coefficients, which is equivalent to saying the the trace Hamiltonian has 
a global unitary invariance. It then seemed natural to suggest that the equiparti- 
tioning of C in a statistical thermodynamical treatment would provide the missing 
connection between trace dynamics and standard quantum mechanics. 

The implementation of this idea was published in Adler and Millard (1996b), and 
I developed it further over the following years with many collaborators, as described 
in Sec. 5 of the "Introduction and Overview" that opens my 2004 book on emergent 
quantum theory. This book, which is set within the framework of complex Hilbcrt 
space, gives a complete, self-contained development of trace dynamics as a (non- 
commutative) dynamics underlying quantum theory. Prom the statistical mechanics 
of this underlying theory there emerge, in a mutually complementary way, both the 
unitary and the nonunitary parts of orthodox quantum theory. The unitary part of 
quantum theory (the canonical algebra and the Hciscnbcrg representation time evo- 
lution of operators) comes from an application of generalized equipartition theorems 
in the statistical thermodynamics of trace dynamics. The nonunitary part of quan- 
tum theory, in the form of stochastic state vector reduction models from which the 
Born rule for probabilities can be derived, comes from the Brownian motion correc- 
tions to this thermodynamics. Thus, trace dynamics provides a unified framework 
from which both the unitary dynamics of quantum systems, and the nonunitary 
evolution describing state vector reduction associated with measurements, emerge 
in a natural way. 

Although quantum mechanics and quantum field theory have been the undis- 
puted basis for all progress in fundamental physics during the last 80 years, the 
extension of the current theoretical frontier to Planck scale physics, and recent en- 
largements of our experimental capabilities, may make the 21st century the period 
in which possible limits of quantum theory will be probed. My 2004 book suggests a 
concrete framework for exploration of the proposition that quantum mechanics may 
not be the final layer of fundamental theory. It also addresses the phenomenology 
of modifications to quantum theory, specifically as implemented through stochastic 
additions to the Schrodinger equation. I have continued with these phenomenolog- 
ical studies since completion of the book; my most recent papers (Bassi, Ippoliti, 
and Adler, 2005; Adler, Bassi, and Ippoliti, 2005; Adler, 2005) have dealt with ana- 
lyzing possible tests of stochastic localization theories in nanomechanical oscillator 
and gravitational wave detector experiments. 
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10. Where Next? 



In looking back at my work, I see one pattern that is repeated over and over. Many 
of the most interesting research results that I have obtained were unanticipated 
consequences of other, quite different research programs. In the course of detailed 
calculations, or speculative explorations, I noticed something that seemed worth 
pursuing, even though tangential to my original motivations, and this new direction 
ended up being of much greater interest. This happened with my calculations of 
weak pion production, which led as spin-offs to the forward lepton theorem, the 
neutrino sum rule, and soft pion theorems. It happened again with my exploration 
of gauging of the axial- vector current as an explanation for the muon mass, which led 
to anomalies. My interest in an eigenvalue condition in QED led to the calculation 
of photon splitting, and later on to an improved method for analyzing collider data. 
My attempts at a composite graviton led to an investigation of Einstein gravity as 
a symmetry breaking effect. My interest in the (spurious) Argonne threshold events 
induced me to extend my weak pion work to neutral currents, which contributed to 
the first unique determination of the electroweak couplings by Abbott and Barnett. 
My attempt to relate monopole background fields to confinement played a role in the 
multimonopole existence proof of Taubes. My computational experience in solving 
effective action confinement models led to overrelaxation as an acceleration method 
for Monte Carlo. And most recently, my interest in composite models for quarks 
and leptons led to a long exploration of the fundamentals of quantum theory, first 
through my study of quantum theory in quaternionic Hilbert space, and growing 
out of that, through my development of trace dynamics as a possible pre-quantum 
theory. 

I think this pattern is no accident, but rather a reflection of my guiding philoso- 
phy in doing research, which has been that it is more important to start somewhere, 
even with a speculative idea or an apparently routine calculation, than to sit around 
waiting for an "important" idea. Once immersed in the nitty-gritty of an investiga- 
tion, things have a way of appearing, that often lead off in very fruitful directions. 
So given this, when I look ahead, I can only say the following: I have some rough 
ideas as to where I would like to start in new explorations in fundamental theory 
and particle phenomenology, but I cannot say where these may ultimately lead, in 
the course of my continuing adventures in theoretical physics. 



